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We consider a diffusion on a potential landscape which is given by a smooth 
^ Hamiltonian H : R" — )■ R in the regime of small noise e. We give a new proof 

of the Eyring-Kramers formula for the spectral gap of the associated genera- 
te tor L = eA — VH ■ V of the diffusion. The proof is based on a refinement of 
I— —I the two-scale approach introduced by Grunewald, Otto, Westdickenberg, and 
Villani [GOVW09] and of the mean-difference estimate introduced by Chafai 
and Malrieu [CMIO]. The Eyring-Kramers formula follows as a simple corol- 
la lary from two main ingredients: The first one shows that the spectral gap 
of the diffusion restricted to a basin of attraction of a local minimum scales 
^ nicely in e. This mimics the fast convergence of the diffusion to metastable 
states. The second ingredient is the estimation of a mean-difference by a new 
I weighted transport distance. It contains the main contribution of the spec- 
tral gap of L, resulting from exponential long waiting times of jumps between 
I metastable states of the diffusion. 

in 

^ MSG: Primary 58J65; secondary 60J45; 60J60; 35P15; 49Q20. 

CN| Keywords: Eyring-Kramers formula, diffusion process, Kramers law, metastability, Poin- 

care inequality, spectral gap, weighted transport distance. 



1. Introduction and main results 

We consider a diffusion on a potential landscape which is given by a smooth Hamiltonian 
H : R" — )■ R. We are interested in the regime of small noise e. The generator of the 
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diffusion lias the following form 



L = eA-VH -v. 



(1.1) 



The associated Dirichlet form is given by 




(1.2) 



The corresponding diffusion satisfies the stochastic differential equation 



d6 



(1.3) 



where Bt stands for the Brownian motion on R". The last equation is often also called 
over-damped Langevin equation. Under some growth assumption on H (cf. Section 1.1) 
there exists an equilibrium measure of the according stochastic process, which is called 
Gibbs measure and is given by 



We are particularly interested in the case where H has several local minima. Then, the 
process shows metastable behavior in the sense that there exists a separation of scales. 
On the fast scale, the process converges quickly to a neighborhood of a local minima. On 
the slow scale, the process stays there for an exponentially long waiting time after which 
it jumps to another local minima. 

This behavior is well known in the context of chemical reactions. The exponential waiting 
time follows the Arrhenius law [Arr89] meaning that the mean exit time from one local 
minima of H to another one is exponentially large in the energy barrier between them. 
By now, the Arrhenius law is well-understood even for more complicated systems by the 
Freidlin-Wentzell theory [FW98], which is based on large deviations. 

A refinement of the Arrhenius law is the Kramers law which additionally considers 
pre-exponential factors. The Kramers law goes back to Eyring [Eyr35] and Kramers 
[Kra40], which considered the one-dimensional situation. Both argued that also in high- 
dimensional problems of chemical reactions most reactions are nearby a single trajectory 
called reaction pathway. Evaluating the Hamiltonian along this reaction coordinate gives 
the classical picture of a double well potential in one dimension with an energy barrier 
separating the two local minima for which explicit calculations are feasible. 

A rigorous proof of the Kramers law for the multidimensional case was open for a long 
time until Bovier, Eckhoff, Gayrard, and Klein [BEGK04, BGK05] obtained first order 
asymptotics that are sharp in the parameter e. They also clarified the close connec- 
tion between mean exit times, capacities and the exponentially small eigenvalues of the 
operator L given by (1.1). To set the nomenclature, for us the Eyring-Kramers for- 
mula is a sharp asymptotics for the exponentially small eigenvalues. The main tool 
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of [BEGK04, BGK05] is potential theory. The small eigenvalues are related to the mean 
exit times of appropriate subsets of the state space. Further, the mean exit times are 
given by Newtonian capacities which can explicitly be calculated in the regime of small 
noise e. 

A httle bit later Helffer, Klein, and Nier [HKN04, HN05, HN06] also deduced the Eyring- 
Kramers formula using the connection of the spectral gap estimate of the Fokker-Planck 
operator L given by (1.1) to the one of the Witten Laplacian. This approach makes it 
possible to get quantitative results with the help of semi classical analysis. They deduced 
sharp asymptotics of the exponentially small eigenvalues of L and gave an explicit ex- 
pansion in e to theoretically any order. For a nice overview on the Kramer's law we refer 
the reader to the article [Berll]. 

The aim of this article is to provide a new self-contained proof of the Eyring-Kramers for- 
mula for the first eigenvalue. Functional inequalities are the background of our approach. 
Our proof is based on two-basic ingredients: 

• The first one is a refinement of the two-scale approach of Grunewald, Otto, West- 
dickenberg, and Villani [GOVW09] leading to a good Poincare inequality for Gibbs 
measures restricted to a basin of attraction of a local minima (cf. Theorem 1.7). 
This yields a good spectral gap for the diffusion restricted to a basin of attraction 
and mimics the fast convergence of the diffusion to metastable states. 

• The second one is a refinement a transport technique mentioned by Chafa'i and 
Malrieu [CMIO] leading to a mean-difference estimate (cf. Theorem 1.9). It con- 
tains the leading order contribution in e of the spectral gap of L, resulting from 
exponential long waiting times of jumps between metastable states of the diffusion. 

From these two ingredients, the Eyring-Kramers formula for the first eigenvalue directly 
follows as a simple consequence (cf. Corollary 1.10) using again the refinement of the 
two-scale approach. 

The remaining part of the introduction is organized as follows: In Section 1.1, we state 
the precise assumptions on the Hamiltonian H. Whereas in Section 1.2, we state the main 
results of the article and outline the main idea of the new proof of the Eyring-Kramers 
formula. 

1.1. Assumptions on the Hamiltonian H 

Before starting the precise assumptions on the Hamiltonian H, we are introducing the 
notion of a Morse function. 

Definition 1.1. A smooth function H : R" — t- R is a Morse function, if the Hessian 
V^-ff of H is non-degenerated on the set of critical points. More precisely, for some 
1 < C < oo it holds 
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We make the following assumption on the Hamiltonian H. 

Assumption 1.2. We assume that H G C^(]R",E,) is a Morse function. Further, for 
some constants Ch > and Kh > it holds 

liminf |Vi7| > Ch. (A1) 

limm{\VH\'^ - AH > -Kh. (A2) 

The Assumption 1.2 yields the following consequences for the Hamiltonian H: 

• The condition (Al) ensures that is integrable and therefore can be normalized 
to a probability measure on R^. Hence, the Gibbs measure /i given by (1.4) is well 
defined. 

• A combination of the condition (Al) and (A2) ensures that there exists a spectral 
gap for the operator L given by (1.1). For the definition of the spectral gap and 
for the argument, we refer the reader to the Appendix C. Equivalently, this means 
by the variational characterization of the spectral gap of L (cf. Lemma 1.5) that 
the Gibbs measure given by (1.4) satisfies a Poincare inequality for sufficiently 
small e. 

• Basically, the Lyapunov-type condition (A2) allows to recover the spectral gap of 
the full Gibbs measure fi from the spectral gap of the Gibbs measure restricted 
to some ball Br with large but fixed radius R> (cf. Section 4.1 or [BCG08]). 

• The Morse assumption (1.5) together with the growth condition (Al) ensures that 
the set S of critical points is discrete and finite. In particular, it follows that the 
set of local minima Ai = {mi, . . . , itim} is also finite i.e. M := < oo. 

For the proof of the Eyring-Kramers formula we need an additional non- degeneracy 
assumption concerning the values of H{x) at critical points x G S. For that purpose, let 
us introduce the saddle height H{mi, mj) between two local minima rm, mj by 

H{mi,mj) =inf| max H{-f{s)) : 7 G C([0, 1], R"), 7(0) =771^,7(1) = mj 

Assumption 1.3. There exists 6 > such that: 

(i) The saddle height between two local minima nii, mj is attained at a unique critical 
point Sij G S, i.e. it holds H{sij) = H{mi,mj). 

It follows from Assumption 1.2 that Sij is a saddle point of index one, i.e. 
{x S IR" : (V^-fr(sjj)x, < 0} is one- dimensional. 
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(ii) The set of local minima M = {mi, . . . , uim} is ordered such that mi is the global 
minima and for all i £ {3, . . . , M} yields 

H{si,2) - H{m2) > H{sia) - H{mi) - 5. 

The Assumption 1.2 and 1.3 are almost the same as the ones used by [BGK05]. However, 
there are two minor differences which are outlined in the next remark. 

Remark 1.4. We compare our conditions on H with the conditions of [BGK05]. 

• For convenience, we assume that the domain of H is R". As in [BGK05], our 
argument would also work for any open and connected subset domain D C R"" 
satisfying H{xi) — t- oo whenever Xj — ?■ x £ dD. 

• Note that the non-degeneracy assumption (1.5) holds for all critical points x of H. 
In [BGK05], the assumption (1.5) is only needed for the local minima m^ and the 
saddles Sjj. We need this slightly stronger assumption in the proof of the local 
Poincare inequality (cf. Theorem 1.7). In the argument of [BGK05] does not need 
a local Poincare inequality. 

1.2. Idea of the new proof of the Ey ring- Kramers formula and main results 

Due to the fact that our approach is motivated by functional inequalities, we are using the 
well-known variational characterization of the spectral gap by the Poincare inequality. 

Lemma 1.5 (Variational characterization of the spectral gap of L). We say that the Gihhs 
measure /i on K," given by (1.4) satisfies the Poincare inequality PI{q) with constant q > 
if for all functions f : K," — )• R 

var^(/) := I (^f- I fdf^j d/x < ^ ^ |V/|'d/i = ^£{f), Pl{g) 
where the Dirichlet form E is given by (1.2). 

Then, the operator L given by (1.1) has a spectral gap of size ge if and only if the Gibbs 
measure fi satisfies PI{q) with optimal constant g. 

The new proof is based on a refinement of the two-scale approach [GOVW09] and a 
transport technique introduced in [CMIO]. Motivated by the fast convergence of the 
diffusion given by (1.3) to metastable states, we decompose the Gibbs measure fj, into 
a mixture of local Gibbs measures fii in the following way. To every local minima mi £ Ai 
we associate its basin of attraction ilj defined by 

:= |y G R" : lim yt = m„yt = -VH{yt),yo = y} . (1.6) 



5 



1. Introduction and main results 



Up to sets of Lebesgue measure zero, the set Vm = is a partition of IR". To each 

element of the partition fij we associate the local Gibbs measure /Xj given by 

1 / H(x)\ 
Hi{dx) = -^-^ '^Qiix) exp —j dx, where Zi = iJ,{0.i). (1.7) 

Note that Zi = 1, since is a partition of R" and /x a probability measure. The 

starting point of our argument is a representation of the Gibbs measure /i as a mixture 
of the mutual singular measures /Xj, namely 

/i = Zifii H + Zm/J-m- 

This decomposition of fi gives rise to a decomposition of the variance var^(/) into local 
variances and a variance between the averages on Oj, which we will call mean-difference. 



Lemma 1.6. For all f : H" — )• R holds the splitting 

M M 

var^(/) = Y,Z^ var^,(/) + ^ E ^^^^^ (^^ J " E^./)' • (1-8) 

1=1 i=l j<i 

We call the term var^. (/) local variance and (E^-(/) — E^^(/))^ mean- difference. 

The statement of Lemma (1.6) is verified by an easy straightforward calculation, which 
we outline for the convenience of the reader in Section 2. 

The main results of this paper are good estimates of the right-hand side of (1.8) (cf. The- 
orem 1.7 and Theorem 1.9 below). The Eyring-Kramer formula is then a simple conse- 
quence of these estimates (cf. Corollary 1.10 below). 

Let us now turn to the estimation of the local variances var^. (/). From the heuristic 
understanding of the process given by (1.3), we expect a good behavior of the local 
Poincare constant for the local Gibbs measures fii, because it would resemble the fast 
convergence of to a neighborhood of the next local minimum. Therefore the local 
variances should not contribute to the leading order expansion of the total Poincare 
constant of fi. This idea is quantified in the following theorem. 

Theorem 1.7 (Local Poincare inequality). The local measures {fii}f^i, obtained by re- 
stricting fj, to the basin of attraction fij of the local minima nii (cf. (1.7) ), satisfy PI(^>i) 
with 

e-' = 0{e). 

Remark 1.8. Using the variational characterization of the spectral gap (cf. Lemma 1.5) 
one can easily see the following consequence of Theorem 1.7: The spectral gap the 
diffusion given by (1.3) reflected at the boundary of a basin of attraction is at 
least of order 1. This reflects the heuristic idea of a scale separation of the diffusion 
into a fast and a slow scale. 
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Even if there are simple heuristics for the validity of Theorem 1.7, we need an elaborated 
machinery for the proof. The reason is that our situation goes beyond the scope of the 
standard tools for Poincare inequalities. We outline the argument for Theorem 1.7 in 
Section 4. It is based on an explicit construction of a perturbation H of the Hamiltonian 
H (cf. Appendix A). 

Let us now turn to the estimation of the mean- difference (E^,(/) - . Note that 

the term 

M 

1=1 j<i 

can be interpreted as the variance of a Bernoulli variable with probabilities Zi, . . . , Zm- 
Therefore we expect to see in the estimation of (E^./ — E^j^./) the exponential long 
waiting times of the jumps of the diffusion between the basins of attraction fij. We 
have to estimate the mean-differences on the right hand side of (1.8) in terms of the 
Dirichlet form £. Practically, we have to find a good upper bound for the constant C in 
the inequality 

{^,^{f)-^,^{f)f <C j iV/l^d^. 

Therefore, we introduce in Section 3.1 a new transport distance between probability mea- 
sures, which allows variational characterization of the constant C. By an approximation 
argument (cf. Section 3.2) we give an explicit construction of a transport interpolation 
(cf. Section 3.3) estimating asymptotically sharp the constant C, namely 

Theorem 1.9 (Mean-difference estimate). The mean- differences between the measures 
/ij and satisfy 

(E^,(/)-E^^.(/))^< J|detV2g(.,,)|e^^(-'^^)i?(£) / |V/pd/., 

where X~{sij) denotes the negative eigenvalue of the Hessian V'^H{sij) at the 1-saddle 
Si J. The multiplicative error term i?(e) satisfies 

R{£) = 1 + 0(Vi|lne|^). 



The proof of Theorem 1.9 is carried out in full detail in Section 3. 

Now, let us turn to the Eyring-Kramers formula. In Section 2 we show that a com- 
bination of Theorem 1.7 and Theorem 1.9 with Lemma 1.6 immediately leads to the 
multidimensional Eyring-Kramers formula (cf. [BGK05, Theorem 1.2]). 

Corollary 1.10 (Eyring-Kramers formula). The measure /i satisfies Pl{g) with 



1 < _4!L_V^^^S3e^(-(-)--(-)) (l + 0(Vi|ln.|i) 
Q ~ |A-(si,2)| v^detV2if(m2) V vv i i ; 



:i.9) 



In [BGK05, Theorem 1.2]) it is also shown that the upper bound of (1.9) is optimal. In 
the context of the Arrhenius' law, our new argument for the Eyring-Kramers formula 
shows some affinity to the proceeding in article [GOV87]. 
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2. The Eyring-Kramers formula: proof of Corollary 1.10 

The aim of this section is the proof of Corollary 1.10. However, let us first deduce the 
decomposition result of the variance: 

Proof of Lemma 1.6. With the definition of the local Gibbs measure //j (cf. (1.7)), we 
get by straightforward calculation 

var^(/) = j f^fi -(^1 fdf?j =Y.^^j - (/ ^d-") 

= E f/ /'d/^^ - (/ /d^^) + fe ( / /d/^') ' - (E j /d^^) • 

=var^ - (/) 

It only remains to show the identity. 

Y,Z^(^fd^^X-(S2ZJ^fd^,X = Y,J2^^^^(^ /d^^-/ /d/^.Y- (2-1) 

i j i j<i 

For convenience, let us introduce the notation 
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By expanding the square, we get that the left hand side of (2.1) can be written as 
^ifi ~ ZiZjfifj = Zi{l — Zi)fi — 2 ZiZjfifj. 

i i,j i i j<i 

By using the identity 

1 - = ^ Zj, 

the first term on the right hand side of (2.1) can be rewritten as 

^ z,(i - Zi)n^ = E E ^^^^f^^ + E E = E E ^^^^ (J^^ + -^Z 

i i j<i i j>i i j<i 

Hence, we finally get the desired identity 

E ^^~f^' - (E ^if^) ' = E E ^^^^(z^' - ^f^fi + 



i j<i 



□ 



Proof of Corollary 1.10. We start from the decomposition of the variance into local vari- 
ances and mean-differences given by Lemma 1.6. Then an application of Theorem 1.7 
and Theorem 1.9 yields the estimate 

J^jp^^ < j:Z.var,,(/) + EE^^^. (E..(/) -E,^,(/))^ 

< O(^) + EE ff U-f VldetV^^(.,,)|ei^(-'^^)i^(.). ^'''^ 

There is only left to estimate the product ZiZjZ^. An asymptotic expansion of the 
partition sums Z^ and Z^. shows 

ry f ^ , ( {IT^E)^ ^, n+1 , \ H(m,) 

ZiZa = e edx= \ , ^ ' = + 0(e 2 ) e 

^ v/detV2/7(mi) ^ ^ 

where we used the normalization convention H{mi) = 0. Therewith, it holds the asymp- 
totic expansion 

ZjZ^ZjZ^ ^ (27r£) t ^ det V^ff (mj _^ /^x 

v^det V^H{mi) det V^H{mj) 



9 
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Combining this quantity with the mean-difference estimate of Theorem 1.9 leads to 

var;.(/) < 2^^detV2ff(mi) \detvm{sij)\ ^i(H(s,.,)-HM-Him,)) -^^^-^ 
f\Vf\^dfi-^f^^\Xjj{sij)\ y'detVm{mi)detVm{mj) ' 

(2.3) 

Using the property (ii) of Assumption 1.3, we see that the dominating term is given by 
i = 2 and j = 1- This leads to the Kramers-Eyring formula (1.9). □ 

Remark 2.1 (Higher exponential small eigenvalues). The statement of [BGK05, Theo- 
rem 1.2]) does not only characterize the second eigenvalue of L (i.e. the spectral gap) but 
also the higher exponentially small eigenvalues. These characterizations also pop up in 
our approach: The dominating exponential modes in (2.3) by setting j = 1 correspond 
for z = 2, . . . , M to the inverse eigenvalues of L. Hence we believe that our approach 
should allow to identify these exponentially small eigenvalues. By the variational char- 
acterization of eigenvalues of the operator L (cf. Lemma 1.5) one would have to restrict 
the class of test functions / to the orthogonal complement of the eigenspaces of smaller 
eigenvalues. 



3. The mean-difference estimate: proof of Theorem 1.9 

This section is devoted to the proof of Theorem 1.9. We want to estimate the mean- 
difference (lE^i/ — lE^j /) for i and j fixed. The proof consists of four steps: 

In the first step, we introduce the weighted transport distance in Section 3.1. This distance 
depends on the transport speed similarly to the Wasserstein distance, but in addition 
weights the speed of a transported particle w.r.t. the reference measure fx. The weighted 
transport distance allows in general for a variational characterization of the constant C 
in the inequality 

(E^,(/)-E^^,(/))'<c| iV/l^d^. 

The problem of finding good estimates of the constant C is then reduced to the problem of 
finding a good transport between the measures fii and w.r.t. to the weighted transport 
distance. 

For measures as general as and fij, the construction of an explicit transport interpo- 
lation is not feasible. Therefore, the second step consists of an approximation, which 
is done in Section 3.2. There, the restricted measures /Xj and fij are replaced by sim- 
pler measures Ui and vj, namely truncated Gaussians. We show in Lemma 3.6 that this 
approximation only leads to higher order error terms. 

The most import step, the third one, consists of the estimation of the mean-difference 
w.r.t. the approximations Ui and vj. Because the structure of and i/j is very simple, 
we can explicitly construct a transport interpolation between Vi and z^j (see Lemma 3.11 
in Section 3.3). 
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The last step consists of combining the results of Lemma 3.6 and Lemma 3.11 (cf. Sec- 
tion 3.4). 



3.1. Mean-difference estimates by transport 

At the moment let us consider two arbitrary measures uq <^ ^ and vi <C /i. The overall 
aim is to estimate the mean-difference 

(E,„(/)-E,,(/))2. 

The starting point of the estimation is a representation of the mean-difference as a 
transport interpolation. This idea goes back to Chafa'i and Malrieu [CMIO]. However, 
they used a similar but non-optimal estimate for our purpose. Hence, let us consider a 
transport map U between uq and i^i, i.e. the push forward of under the map U is given 
by C/((Z^o = ^1- Further, let (^s)^^^,!] t)e a smooth interpolation between the identity and 
the transport map f7, i.e. 



$0 = Id, = U, and («>s)po 



The representation of the mean-difference as a transport interpolation is attained by 
using the fundamental theorem of calculus, i.e. 



(E,,(/)-E,,(/))2= ^'^i^dsdzvo) = {^j'^ j\vf{^s),^s)Au,As^ 

At this point it is tempting to apply the Cauchy-Schwarz inequality in L?'{Auq x ds) 
leading to the estimate of Chafa'i and Malrieu [CMIO]. However, this strategy would not 
yield the pre-exponential factors in the Eyring-Kramer formula (1.9) (cf. Remark 3.2). On 
Stephan Luckhaus' advice the authors realized the fact that it really matters on which 
integral you apply the Cauchy-Schwarz inequality. This insight lead to the following 
proceeding 



(E,„(/) - E,,(/))2 = (^^' j (V/(cI>,), 4>,) dz.0 ds) 



2 



<(/iv/ij;i*,o*-i^d,d,)' 



Note that in the last step we have applied the Cauchy-Schwarz inequality only in L^(d/x) 
and that the desired Dirichlet integral J | V/|^ d^u is already recovered. 
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The prefactor in front of the the Dirichlet energy on the right hand side of (3.1) only 
depends on the transport interpolation ($s)^g[o,i] ■ Hence, we can infimize over all possible 
admissible transport interpolations and arrive at the following definition. 

Definition 3.1 (Weighted transport distance). Let be an absolutely continuous prob- 
ability measure on IR," with connected support. Additionally, let i'q and vi be two proba- 
bility measures such that vq <^ ^ and z^i <C /i, then define the weighted transport distance 
by ^ 

r^{uo,u,)=infl (^£\^sO^;'\^ds^ dfi. (3.2) 

The family ($s)gg[o,i] is chosen absolutely continuous in the parameter s such that <I>o = 
Id on supp and ($i)jfo = i^i- For a fixed family and (*I*s)sg[o,i] ^ point x £ supp fi 
the cost density is defined by 

A{x):= [ M^) ^s. (3.3) 

Jo 

Remark 3.2 (Relation of 7^ to [CMIO]). In general, the transport cost T^ii'o, I'l) is always 
smaller than the constant obtained by [CMIO, Section 4.6]. Indeed, applying the Cauchy- 
Schwarz inequality on L^(ds) in (3.2) yields 

where we used the assumption that Us <^ for all s G [0, 1] in the last — L°° estimate. 

Remark 3.3 (Relation of 7^ to the — Wasserstein distance W2 )• If the support of /i is 
convex, we can set the transport interpolation (<^s)ggjo,i] to the linear interpolation map 
$s(x) = (1 — s)x + sU{x). Assuming that U is the optimal Ty2-transport map between 
and i/i, the estimate in Remark 3.2 becomes 



T^ivQ^vi) < ( sup / 



'^[x)ds]wi{uo,yi). (3.4) 



Remark 3.4 (Invariance under time rescaling). The cost density A given by (3.3) is 
independent of rescaling the transport interpolation in the parameter s. Indeed, we 
observe that 

A{x)= f\^sO^-,\x)\us{x)ds= r\^fo{<^f)-\x)\,yf{x)dt, 
Jo Jo 

where = ^t/T a-iid uj' = Vt/T- 
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In this paragraph, we show that 7^(-, •) actually is a distance justifying the term weighted 
transport distance. It turns out that the distance 7^(-, •) is a metric on a subspace of the 
space of probability measures on R" with finite second moment. The main restriction is 
that the weighted transport distance is very sensitive to support constraints, which are 
difficult to check for interpolations between general measures. 

Proposition 3.5. Assume that jj, is absolutely continuous w.r.t. the Lehesgue measure. 
Additionally, assume that fj, has convex support and finite second moment. Then 7^(-,-) 
is a distance on the space 

:= {v G ^(111") : 1/ <C /i, suppz^ compact] . 



Proof of Proposition 3. 5. The symmetry follows from the observation that if s i— )■ $ <j is 
an optimal interpolation between a-nd z^i, then s i— )• ^i-s is an optimal interpolation 
between i^i and fg- 

Let us consider the definiteness. Therefore, we assume that 7^(fo,i^i) = 0, then 

<I>s o -;— ds = 0, /x-a.e. 
d/i 

Hence, by integrating w.r.t. /i and interchanging the order of integration we obtain 
0=/ [\^soK^\^dnds= [ [\^sO'^s^\dusds= [ [\^s\duods. 



JO J dfj, JQ J JQ J 

This shows that = on supp i^q. Therefore it holds <I><j = Id |suppi/o resulting in Vs = '^o 
for all s G [0,1]. 

Let us consider the triangle inequality. We have to show that for arbitrary measures 
^0) 1^1/2) ^1 ^ A* it holds 

T^ivQ.vi) < Tfj,{i^o,ui) + (3.5) 

2 2 

Let ($s)sg[o,i] be an interpolation between vq and vi such that [0, 1] 9 s i— >• $s/2 is an 
interpolation between vq and and [0, 1] 9 s i— ?■ ^*s/2+i/2 is an interpolation between 
1^1/2 and vi. An application of the triangle inequality in L^(d/i) yields the estimate 

1 

2 



< 
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Because of the invariance of the cost density under rescahng of time (cf. Remark 3.4) 
and the arbitrariness of the transport (^s)se[o,i]) the last inequahty already implies the 
desired triangle inequality (3.5). 

In the last step we show that 7^(z^0) i^i) < oo for I'Oji'i G V^. For that purpose, we apply 
the bound in terms of the Wasserstein distance (3.4). Then, it is sufficient to show that 
v G Vfj, has finite second moment provided that fj, has finite second moment. This fact 
follows from the change of measure 



\x\ av 



2 dj^ , 

X -;— d/i < 

d/n 



dfj, 



\x\ dfi < CO. 



In the last inequality we used the observation 
compactness of the support of v. 



du 
dfi 



< OO, which holds due to the 

□ 



3.2. Approximation of the measures /i. 

In this subsection we show that it is sufficient to consider only the mean-difference 
w.r.t. some auxiliary measures approximating ^j. More precisely, the next lemma 
shows that there are nice measures i^i which are close to the measures in the sense of 
the mean-difference. 

Lemma 3.6 (Mean-difference of approximation). Let Vi he a truncated Gaussian centered 
around the local minima nii with covariance matrix Sj = {V'^H{mi))~^ , more precisely 

fj(dx) = — — e 2e lEi{x) dx, where Z^. = e 2= dx. (3.6) 

^i^i JEi 
Here and further on, we use the convention that for a matrix M and a vector x we write 

M[x] := {x,Mx) . 

The restriction Ei is given by an ellipsoid 



Ei = {x e : \T.. ^x - mi)\ < V2e uj{e)}. (3.7) 

Additionally, assume that fii satisfies PI(^>j) with g^^ = 0{e). 
Then the following estimate holds 

(E,,(/) -E^,(/))2 < 0{elu;He)) J iV/pd/., (3.8) 

where the function uj{£) : 11+ — t- 11+ in (3.7) and (3.8) is smooth and monotone satisfying 

1 

w(e) > |lne| 2 for e < 1. 
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3. The mean-difference estimate: proof of Theorem 1.9 



The first step towards the proof of Lemma 3.6 is the following Lemma. 

Lemma 3.7. Consider an arbitrary measure Vi satisfying Vi <^ ^i. Additionally, assume 
that fii satisfies Pl(^i) for some Qi > 0. Then the following estimate holds 



(E,,(/) -E^,(/))2 < ivar^, (0) j \V f\' d^. 



(3.9) 



Proof of Lemma 3. 7. The result is a consequence from the representation of the mean- 
difference as a covariance. Therefore, we note that dvi = d//j since Vi <^ fii and use 
the Cauchy-Schwarz inequality for the covariance 

(E,,(/) - E^,(/))2 = J fdu, - j /d/i, = I f^ df,,- I fdfH I dfii 

^ ' 

Using the fact that /ij satisfies a Poincare inequality results in (3.9). □ 

The above lemma tells us that we only need to construct Vi, which approximates /Xj in 
variance. The following lemma provides exactly this. 

Lemma 3.8 (Approximation in variance). Assume that the measures Vi are given by 
Lemma 3. 6. Then the partition sum Z^. satisfies for e small enough 



Z^^ = (27re)tv/detSi {l + 0{^/e)). (3.10) 
Additionally, Vi is a good approximation in variance of fn, i.e. 

(^^)=0(Vi.»(e)). (3.11) 



var^,. 



Proof of Lemma 3.8. The proof of (3.10) reduces to an estimate of a Gaussian integral 
on the complementary domain IR"\£'j. By recalling, that (27re) 2 -^/det Sj is the normal- 
ization for a Gaussian with covariance matrix S,-, we arrive at 



S . fx — m,- 



Zy^= I e 2e dx = (27re) 2 dei 1 „ / e 2^ dx 



E, \ (27re) 2 det S j Jb."\e, 

The integral on the complementary domain R'^\Ei evaluates by the change of variables 

_ 1 

X I— 5- y = (2eSj) ^ {x — mi) to 



00 



e ' 2e ' dx = j e ^^dy = / r" ^"dr 



(27re) 2 det Sj JR^XEi '^'^ ^{^ + ^)JLu{e) 

r(t) ya;2(e) r(|) 



15 



3. The mean-difference estimate: proof of Theorem 1.9 

where r(^,a;^(e)) is the complementary incomplete Gamma function. It has the asymp- 
totic expansion [01v97, p. 109-112] given by 

r = 0(e^-'(^)u;"-2(e)), for u;ie) > Vn. 

We obtain (3.10) by the choice of w(e) > |lne|2, since the error becomes 

0(e-"''(^)a;"-2(e)) = 0(e|lne|t-i) = 0(Vi), for e < e-"". 

For the proof of (3.11), we compare the asymptotic expression for Z^. = ZiZ^^e^ 
from (2.2) and Z^. and obtain 

Z^^ = Z., + 0(^). (3.12) 

The relative density of Vi w.r.t. /ij can be estimates by a Taylor expansion of H around rn-j. 
By the definition of Vi given in (3.6), we obtain that E^^[y — mj] — Hi{y) = 0{\y — mi\^). 
This observation together with (3.12) leads to 

Now, the conclusion directly follows from the definition of the variance 

= / l + 0(Via;3(e)) d/ii- f / Av\ = \ ^ 0{^e u?{e)) - \. 

□ 

Proof of Lemma 3.6. A combination of Lemma 3.7 and Lemma 3.8 together with the 
assumption Q~'^ = 0(e) immediately reveals 

(3.9), (3.11) 3 f 

(E,,(/)-E^,(/))2 < 0{e-^u:\e)) j \V f\' d^l,. 

□ 

3.3. Affine transport interpolation 

The aim of this subsection is to estimate (Ej^. (/) — Wiy. (/)) ^ with the help of the weighted 
transport distance T^^Ui^Vj) introduced in Section 3.1. The main result of this section 
estimates the weighted transport distance 7^(t'j,^'j) and is formulated in Lemma 3.11. 
For the proof of Lemma 3.11, we construct an explicit transport interpolation between 
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3. The mean-difference estimate: proof of Theorem 1.9 

Vi and Uj w.r.t. the measure /i. We start with a class of possible transport interpolations 
and optimize the weighted transport cost in this class. 

Now, we state the main idea of this optimization procedure. Recall that the measures 
Vi and Uj are truncated Gaussians by the approximation we have done in the previous 
Section 3.2. Hence, the measures Vi and Vj are characterized by their mean and covariance 
matrix. We will choose the transport interpolation (cf. Section 3.3.1) such that the push 
forward measures Vg •= {^s)i^o are again truncated Gaussians. Hence, it is sufficient 
to optimize among all paths 7 connecting the minima rrii and rrij and all covariance 
matrices interpolating between Sj and Sj. 

3.3.1. Definition of regular afFine transport interpolations 

Let us state in this section the class of transport interpolation among we want to optimize 
the weighted transport cost. 

Definition 3.9 (Affine transport interpolations). Assume that the measures Vi and Vj are 
given by Lemma 3.6. More precisely, i/j = Af{mi,£~^'Ei)i_Ei and Uj = J\f{mj,e~^T,j)i_Ej 
are truncated Gaussians centered in m-j and ruj with covariance matrices e~^Sj and 
e~^Tjj. The restriction Ei and Ej are given for / = 1, 2 by the ellipsoid 

Ei = {x£R'' : \T,p{x-mi)\ < V2euj{e)}, where w(e) > |lne|5 . 

A transport interpolation between t'j and i/j is called affine transport interpolation if 
there exists 

• an interpolation path (7s)se[0,T] between mi = 70 and mj = jt satisfying 

7 = (7s).e[o,T] eC2([0,r],R") and G [0, T] : 7, G 5"-\ (3.13) 

• an interpolation path (Es)sg[o,T] of covariance matrices between Sj and Sj satisfy- 
ing 

S = (S,),g[o,T] GC2([0,T],5+(ri)), So = Si and St = S,, 
such that the transport interpolation (^s)se[o,r] is given by 

$,(x) = S|So^(x-mo) + 7s. (3.14) 

Since the cost density A given by (3.3) is invariant under rescaling of time (cf. Re- 
mark 3.4), one can always assume that the interpolation path 7^ is parameterized by 
arc-length. Hence, the condition 7^ G S^~^ (cf. (3.13)) is not restricting. 
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3. The mean-difference estimate: proof of Theorem 1.9 

We want to emphasize that for an afHne transport interpolation (^s)se[o,r] the push 
forward measure ($s)(jZ/o = i^s is again a truncated Gaussian A/'(7s, e~^Ss)Li?<j, where Es 
is the support of being again an eUipsoid in given by 

E, = {x£ R" : - js)\ < V2£ uj{e)}. (3.15) 
Therewith, the partition sum of Vs is given by (cf. (3.10)) 

Z^^ = (2^e)tv^detS,(l + 0{VE)). (3.16) 

1 

By denoting 0"^ = S| and using the definition (3.14) of the affine transport interpola- 
tion ($s)5g[o^T], we arrive at the relations 

6s (x) = &s(Tq^{x - mo) +7s, 
'^s^iy) = o-o(^s^{y - Is) + mo, 

^so^jHv) = <js(jjHy-is)+is- 

Among all possible affine transport interpolations we are considering only those satisfying 
the following regularity assumption. 

Assumption 3.10 (Regular affine transport interpolations). The affine transport inter- 
polation (7s, Ss)sg[o,r] belongs to the class regular affine transport interpolations if the 
length T < T* is bounded by some uniform T* > large enough. Further, for a uniform 
constant c-y > it holds 

inf {r(x, y,z) : x,y,z £ -f,x y^y z x} > c^, (3.17) 

where r{x,y,z) denotes the radius of the unique circle through the three distinct points 
x,y and z. Furthermore, there exists a uniform constant Cs > 1 for which 

C^^ Id < Ss < Cs Id and \\t,\\ < Cs- 

The infimum in condition (3.17) is called global radius of curvature 
(cf. [GMSvdM02]). It ensures that a small neighborhood of size ^ 
around 7 is not self-intersecting, since the infimum can only be attained 
for the following three cases: 

(i) All three points in a minimizing sequence of (3.17) coalesce to a 
point at which the radius of curvature is minimal. 

(ii) Two points coalesce to a single point and the third converges to 
another point, such that the both points are a pair of closest 
approach. 

(iii) Two points coalesce to a single point and the third converges to 
the starting or ending point of 7. 



(3.18) 
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3. The mean-difference estimate: proof of Theorem 1.9 



In the following calculations, there often occurs a multiplicative error of the form 1 + 
0{^/£ uj'^{£)). Therefore let us introduce for convenience the notation "~" meaning "=" 
up to the multiplicative error 1 + 0{^/£ UJ^ie)). The symbols "<" and ">" have the 
analogous meaning. 

Now, we can formulate the key ingredient for the proof of Theorem 1.9, namely the 
estimation of the weighted transport distance 7^(fi, Vj). 

Lemma 3.11. Assume that vi and Uj are given by Lemma 3.6. Then the weighted 
transport distance T^{i'i,i'j) can he estimated as 

T>,.0 = w/(j^'l*."*,-'l|jd»)'d„ 

(27re) 2 \ |A I v2vre j 

where the infimum over ^ s only considers regular affine transport interpolations in 
the sense of Assumption 3.10. 

In particular, if we choose uj{e) > | lne|2^ which is enforced by Lemma 3.6, we get the 
estimate 

r,2(^^., y,) < — ^ ^ J\det{Vm{s,,)\ e-r^ (l + 0{^e u:\e))) . (3.20) 
(27re) 2 |A I V 

Remark 3.12. With a little bit of extra work one could show that the estimate (3.19) 
in fact is optimal in the class of regular affine transport interpolations in the sense 
of Assumption 3.10 up to a multiplicative error of order 1 + 0{y/e oj'^{e)). However, we 
omitted the argument because it would not provide any further insights for the main goal 
of this article, namely a new proof of the Eyring-Kramer formula (1.9). 

Because the proof of Lemma 3.11 represents the core of our new proof of the Eyring- 
Kramer formula (1.9), we state the argument in full detail. It consits of three steps 
carried out in the following sections: 

• In Section 3.3.2, we carry out some preparatory work: We introduce tube coordi- 
nates on the support of the transport cost A given by (3.3) (cf. Lemma 3.13), we 
deduce a pointwise estimate on the transport cost A, and we give a rough a priori 
estimate on the transport cost A. 

• In Section 3.3.3, we split the transport cost into a transport cost around the sad- 
dle and the complement. We also estimate the transport cost of the complement 
yielding the second summand in the desired estimate (3.19). 

• In Section 3.3.4, we finally deduce a sharp estimate of the transport cost around 
the saddle yielding the first summand in the desired estimate (3.19). 
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3. The mean-difference estimate: proof of Theorem 1.9 



3.3.2. Preparations for the proof of Lemma 3.11 

The main reason for making the regularity Assumption 3.10 on afHne transport inter- 
polations is that then one can introduce tube coordinates around the path 7. In these 
coordinates, the calculation of the cost density A given by (3.3) becomes a lot handier. 

We start with defining the caps Eq and as 

Eq := {xe Eq: {x- 70, 70) < 0} and E+ := {x e Et ■ {x - 7t, 7t) > 0}, 

The caps E^ and E^ have no contribution to the total cost but unfortunately need some 
special treatment. Further, we define the slices Vg with s G [0, T] 

— - 

Vs = {x e span {7s} : |Ss ^x\ < V2e uj{e)} 

In span 14 we can choose a basis eg,...,e" smoothly depending on the parameter s. 
Especially there exists a family of rotational matrices {Qs)s&[o,T] ^ C^ii^^T], SO{n)) 
satisfying the same regularity assumption as the family {'^t)t&[o,T] such that 

Qse^=7s, Qse' = el, for i = 2, . . . , n, (3.21) 
where (e^, . . . , e") is the canonical basis of R*^. 
Let use now define the tube E as 

E= U hs + Vs). 

The support of the cost density A given by (3.3) is now given by 

supp.A= IJ Es = E(^UEUE+. (3.22) 

se[o,T] 

By the definition (3.15) of Eg and the uniform bound (3.18) on it holds 



diamK < 2^/2eC^ uj{e). (3.23) 

Therewith, we find 

supple B2^2^c.(£)((7T)re[o,T]) — {x G : \x--fr\ < 2v/2eCsa;(e)}. 

The assumption (3.15) ensures that B2^ 2eC^ uj{e)i^'^T)T&[o,T]) is not self-intersecting for 
any e small enough. The next lemma just states that by changing to tube coordinates 
in E one can asymptotically neglect the Jacobian determinant det J. 

Lemma 3.13 (Change of coordinates). For the change of coordinates (r, z) ^ x = ^r+Zr 
with Zr €z Vr it holds for any function ^ on E that 



/ ^{x) dx ^ / ^(7r + Zr) dzr dr. 
Je Jo JVt 
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Proof of Lemma 3.13. We use the representation of the tube coordinates via (3.21). 
Therewith it holds that x = + QtZ, where z G {0} x R"~^. Then, the Jacobian J of 
the coordinate change x i— )• (r, Qtz) is given by 

J=iir + QrZ, {Qr)2, {Qr)n) G R"''", 

where {QT)i denotes the i-th column of Qr- By the definition (3.21) of Qr follows 
'j.j. = {Qt-)i. Hence, we have the representation J = + QtZ (8) ei. The determinant of 
J is then given by 

det J = det (Qr + QrZ <^ ei) = det{Qr) det fid +iQjQrz) ei) = 1 + (qJQtZ 



By Assumption 3.10 it holds HQtII ^ Cs) from which we conclude {Qj Qtz)i,i = 0{z). 
Since QrZ G Vr, we get 0{z) = 0{^/e ijj{£)) by (3.23). Hence we get 



which concludes the proof. 



det J = 1 + 0(Ve u{e)), 



□ 




Figure 1: The support of A in tube coordinates. 
An important tool is the following auxiliary estimate. 

Lemma 3.14 (Pointwise estimate of the cost-density A). For x £ supp^ we define 



T = arg min | x — 7^ | and 

se[o,T] 



x-jr 



Then the following estimate holds 

A{x) < (27re)-V^deti,i(Q7S^iQ,) 
where Qt is defined in (3.21) and is given by 



(3.24) 



(3.25) 



(3.26) 



Further, deti^i A is the determinant of the matrix obtained from A removing the first row 
and column. 
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Remark 3.15. With a little bit of additionally work, one could show that (3.25) holds 
with "~" instead of "<". It follows from (3.26) that the matrix T,~^ is positive definite. 
Hence, A is an ]R"~^ -dimensional Gaussian on the slice 'Jt + Vr up to approximation 
errors. 



Proof of Lemma 3.24- We start the proof with some preliminary remarks and results. 
By the regularity Assumption 3.10 on the transport interpolation, we find that for all 
X £ supp^ holds uniformly 

Ixix) := {s : Eg 3 x} satisfies Ti^ilrix)) = 0{ sup diam(£'s)) = 0{^/e w(e)). 

se[o,T] 

This allows to linearize the transport interpolation around r given in (3.24). It holds for 
s such that x £ Eg 

^J'[x - 7,] = S;1[7t + zr- 7s] + Oielu;\e)) 

For similar reasons, we can linearize the determinant det and have det = det S,- + 
0(-\/e uj{e)). Finally, we have the following bound on the transport speed 

\^sO^JHx)\1eSx) = \asa~\x-js) + %\lESx) 

< {\&sa;\x-js)\ + \is\)lEM (3.28) 

< {Cj^\x-Js\ + 1)1eM = {l + 0{^/euj{e)))lEAx)- 

Let us first consider the case x £ E. We use (3.16), (3.27) and (3.28) to arrive with 
X = jr + Zj- where Zr £ Vr at 

A{x) = [ 16, o <D;1(x)| ^ exp (-^^;'[x - js]) IeA^) 

1 /■ l + 0(Vew(e)) / 1 ^ ir A , 
(27re)2 VdetS, V / 



^ (2vre) 



A== / expf--^S;i[(r-s)7. + ^.])ds(l + 0(Vic^=^(£))) 
2 Vdet St Jb, V / 



\/det S"^ yfl^e 



:exp(-ls;iK]) (l + 0(V^u;3(,)) 



where the last step follows by an application of a partial Gaussian integration (cf. 
Lemma B.l). Finally, by using the relation (B.2), we get that 

deti,i(QTs;iQ,), 



and conclude the hypothesis for this case. 

Let us now consider the case x £ Eq U E^. For convenience, we only consider the 
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case X € Eq . By the definition of Eq it holds r = 0. The integration domain It{x) is 
now given by 

It{x) = [Q,s*) with s* = 0{y/eoj{e)). (3.29) 

Therewitli, we can estimate A{x) in the same way as in the case oi x & E and conclude 
the proof. □ 

We only need one more ingredient for the proof of Lemma 3.11. It is a a priori estimate 
on the cost density A. 

Lemma 3.16. The following estimates for A hold: 

j A{x) dx < T, and (3.30) 



n-l 
2 



■^^""^ ^ (S) for xesnpp A. (3.31) 

Proof of Lemma 3.16. Let us first consider the estimate (3.30). It follows from the char- 
acterization (3.22) of the support of A that 

/ A{x) dx= A{x) dx+ A{x) dx. (3.32) 

J Je Je-ue+ 

Now, we estimate the first term on the right hand side of the last identity. Using the 
change to tube coordinates of Lemma 3.13 and noting that the upper bound (3.25) is a 
Gaussian density on Vj- for r G [0, T], we can easily infer that 



/ Aix) dx <h\=T 
Je 



Let us turn to the second term on the right hand side of (3.32). For convenience, we only 
consider the integral w.r.t. the cap E^ . It follows from (3.28) and (3.29) that 

A{x) dx < / / i>s{x) ds dx = / / Usix) ds dx 







s ■ 



< y us{x) ds dx = s* = 0(Vi uj{e)), 
which yields the desired statement (3.30). 

Let us now consider the estimate 3.31. Note by Remark 3.15 the matrix given 
by (3.26) is positive definite and the matrix we subtract is also positive definite. There- 
fore, it holds in the sense of quadratic forms 

< = - — < 
Now, the uniform bound (3.18) yields 



'deti^iiQjtr^Qr) < Cj.^ . 
Then, the desired statement (3.31) follows directly from the estimate (3.25). □ 
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3.3.3. Splitting of transport cost and reduction to neighborhood around the saddle 

Firstly, observe that from (3.31) follows the a priori estimate 

Hence, on an exponential scale, the leading order contribution to the cost comes from 
neighborhoods of points where H{x) is large. Therefore we want to make the set, where 
H is comparable to its value at the optimal connecting saddle Sij, as small as possible. 
For this purpose, let us define the following set 

S-^,s := {x G supp^ : H{x) > H{sij) - eu}^{e)} . (3.34) 

Therewith, we obtain by denoting the complement ^ •= supp^\H^^E the splitting 

The integral on can be estimated with the a priori estimate (3.33) and Lemma 3.16 
as follows 

J\^(x) (3.34) Hjsj.j) 2, ^ r 



ll ,,2 



n — 1 

(3.31) H(s.,,) ,^ ^ f C^\ — 



< Z^e^--'(^) ' I A{x) dx 



n-l 



(3.30) H{s,.,) 2/ N / \ ^ 



\2Tre J 



Notice that the last estimate yields the second summand in the desired estimate (3.19). 



3.3.4. Cost estimate around the saddle 



The aim of this subsection is to deduce the estimate 



-,,E l^(^) (27re)2 I A- 1 

Note that this estimate would yield the missing ingredient for the verification of the 
desired estimate (3.19). 



By the non-degeneracy Assumption 1.3, we can assume that e is small enough such that 
Eq UE^ C HJ^ 2- It follows that H^^s C E. We claim that the transport interpolation 
can be chosen such that there exists a connected interval It C [0, T] satisfying 



,s C \J{Vs + 7.) and H^It) = 0(Vi oo{e)). (3.36) 
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Indeed, the level set {x G K," : H{x) < H{sij) - eio^{e)} 
consists of at least two connected components Mj and Mj 
such that mi £ Mi and mj G Mj. Further, it holds 

distfMj, M,-) = inf \x - y\ = 0(^/e oj(e)), 

xeMi,yeMj 

which follows from expanding H around Sij in direction of 
the eigenvector corresponding to the negative eigenvalue of 
\7'^H{sij). We can choose the path 7 in direction of this 
eigenvector in a neighborhood of size 0{^/e w(e)) around 
Si J, which shows (3.36). 

Combining the covering (3.36) and Lemma 3.13 yields the estimate 

Recalling the definition (3.21) of the family of rotations (<5r)Te[o,r]i it holds that Zr = QtZ 
with z G {0} X IR,"^^. Hence, the following relation holds 

/ / ^%±^d..dr=/ / lvAQrz) '^y^^i drdz. (3.38) 

The next step is to rewrite H{'^t-+Qtz)- By the reason that \zr\ = 0{y/£ a;(e)) for z-r 
and the global non-degeneracy assumption (1.5), we can Taylor expand H{'jT-+Zr) around 
Sij = 7t-* for T £ It and Zr = QtZ G Vr- More precisely, we get 

H{-fr + QrZ) - H{Sij) = ^V^H{Sij)[jr + QrZ - Sij] + 0{\jr + QrZ - Sijf) 

= ^V'His,,,)[^r - Ir'] + lv'His,,,)[QrZ] 

+ {QrZ, V^H{Sij){-fr - 7r*)) + 0(|7r + QrZ " 7r- |^) 

Now, further expanding jr and Qr in r leads to 

7^ = 7^. +7^,(r-r*) + 0(|r-T*|), and QrZ = Qr*z + 0{\t - t*\\z\). 
For the expansion of H, we arrive at the identity 

H{-ir + QrZ)-H{Sij) = 

\v^H{s,,)[^r'{T - T*) + 0(|r - T*p)] + ^V'H{s,,)[Qr*z + 0(|r - t*\ \z\)] 
+ {Qr*z + 0(|r - T*| \z\),V^H{sij) (r - t*) + 0(|r - t*\^)) ) 

+ 0(l7r + Qr2-7r*|^) 

= \v^H{si,j)[^r*] (r - r*)2 + ]^V^H{si^j)[Qr*z] + {Qr*z,V^H{si,j)jr*) (r - r*) 
+ 0{\t - r*|^ \z\ \t - r*|^ jzj^ |r - t*\ , |z|=^). 
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Using |r — r*| = 0{y/£ a;(e)) and \z\ = 0{y/e (^(e)) we obtain for the error the order 
estimate 

0(|r - T*|^ \z\ \t -t*\\ |r - t*\ , \zf) = 0{el u^{e)). 

The term (^Qr*z, \7'^H{sij)'yr*) {t — t*) in the expansion of H has no sign and therefore 
has to vanish. This is only the case, if we choose 'Jt* as an eigenvector of V'^H{sij) to 
the negative eigenvalue A~, because then 

{Qr*Z,V^H{Si,,)^r') {t - T*) = X- {Q,, Z, j^*) = 0. 

Additionally, by this choice of 7,-* the quadratic form \7'^H{sij)['yr*] evaluates to 

V^H{Si,j)[jr*] = X'\jr*\^ = X-. 

Therefore we deduced the desired rewriting of i^(7T + Qtz) as 

H{-/r + Qrz) = H{s^j) - |A-|(r - T*f + ^V^H{si,j)[Qr'z] + 0(ei co^e)). 
From the regularity assumptions on the transport interpolation we can deduce that 

t;^[QrZ] = t;}[QrZ] + 0{\t - T*\ \zf) 

= t;}[Qr*z + 0{\r - T*\ \z\)] + 0(|r - r*| \z\^) 
= t;}[Qr*z] + 0{eluj^{e)). 

Then, it follows easily from the definition (3.25) of Pr that 

Pr^Pr". (3.39) 

Applying the cost estimate (3.25) of Lemma 3.14, the rewriting 3.3.4 of H{^t- + Qrz), 
and the identity (3.39) yields the estimate 

+ < Z,e^P?,e- '"'''""-'"""'-' -^^. (3.40) 

The exponentials are densities of two Gaussian, if we put an additional constraint on the 
transport interpolation. Namely, we postulate 

2t:;} - V^H{sij) > on spanF^. 

in the sense of quadratic forms. Note that spanV^* = Qt*{{0} x IR,"^^) = span {7,-* j"*" 
is the tangent space of the stable manifold in the 1-saddle Sj.j. With this preliminary 
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considerations we finally are able to estimate the right hand side of (3.38) as 

J{0}xR"-i Jit Kir + QrZ) 

< / / Pi e 2i dr dz 



{OjxK, 



-\/|A I 7(0} xR' 



< Zf, e—T^ '^-Z- / P"^* e ' 2^-= dz 



3 n— 1 



(3.41) 



deti,i gJ.(2S;*i - V2/7(s,j))Q,. 



Zu 27re deti,i(g;.S;,^Q, 

g £ ^^^^^^ 

(2vr£)t ^ ./deti,i (gT(2S-i-V2/f(s,,.))Q. 



to optimize! 

The final step consists of optimizing the choice of St-*. Let us use the notation A = 
QI,Ti~}Qt-* and B = Qj,H{sij)QT-* . Then the minimization problem has the struc- 
ture 

inf J deti,i A ^ ■, 2A - B > on {0} x R""^ I . (3.42) 
Aes+(n) [ y/deti,! {2A - B) J 

In the appendix, we show in Lemma B.2 that the optimal value of (3.42) is attained at 
= V'^H{sij) restricted Vr*- The optimal value is given by 

deti 1 A 



Because Vr* is the tangent space of the stable manifold of the saddle Sij it holds 

detu(Qj.V^fl(,.,)eJ.) = ^^(Z;^^ = ^i(V!M. ,3,43) 

A I I 

The final step is a combination of (3.37), (3.38), (3.41) and (3.43) to obtain the desired 
estimate (3.35). 

For the verification of Lemma 3.11, it is only left to deduce the estimate (3.20). For that 
purpose we analyze the error terms in the estimate (3.19) i.e. 



"^^^^'^^■^-(2^^ ' H \X^\ ^^.e 



n-l N 



=o(i) 



=0(£-2e-"^{=)) 
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By our choice of u}{e) > |lne| 2 , enforced by Lemma 3.6, we see that 

0(£-le-'^'(=)) = 0(Vi). 



RecaUing, that "<" means "<" up to a multiphcative error of order 1 + 0{^/e oj^{e)) we 
get 

T>u^,) < tAt »^ 2..^^^^^^ (1 +0(./J (1 + 0(V?)) . 

The last inequahty aheady yields the desired estimate (3.20) by using the observation 

(1 + 0(Vi 0.3(e))) (1 + 0(Vi)) = (1 + 0(Vi u^ie))) . 

Remark 3.17. Let us summarize the additional constraints on the transport interpolation 
besides the Assumption 3.10 of a regular affine transport interpolation to obtain the 
desired estimate (3.19): 

• 7 passes the saddle point Sij at the passage time r* in direction of the eigenvector 
to the negative eigenvalue A~ of V'^H{sij) 

• 7 stays in the sublevel set {H{x) < H{sij) — ea;^(e)} up to a small time interval 
of order ^/e u;{e) around the passage time r* 

• It holds Ti^} = \7'^H{sij) on the stable manifold of Sij. 



3.4. Conclusion of the proof of Theorem 1.9 

With the help of Lemma 3.6 and Lemma 3.11 the proof of Theorem 1.9 is straightforward. 
We can estimate the mean-differences w.r.t. to the measure //j by introducing the means 
w.r.t. the approximations and uj. 

(E^,(/) - E,,^,(/))' = (E^^(/) - E,,(/) + E,,(/) - E,^,(/) + E,^,(/) - E^^,(/))' 

We apply the Young inequality with a weight that is motivated by the final total multi- 
plicative error term -R(e) in Theorem 1.9. More precisely, 

(E^,(/) - E^^,(/))' < {l + e'^u'{e)) (E,^(/) - E,^(/))' 

+ 2(1 + e-5^-3(e)) ((E^,(/) - E,,(/))2 + (E^^ (/) - E,^ (/))') . 

Then, the estimate (3.8) of Lemma 3.6 yields 

(E^,(/)-E^^,(/))'<(l + Viu;3(e))(E,^(/)-E,^(/))' + 0(e) J iV/^d/., (3.44) 
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which justifies the statement, that the approximation only leads to higher-order error 
terms in e. An application of (3.1) to the estimate (3.44) transfers the mean-difference 
to the Dirichlet form with the help of the weighted transport distance 

(E^,(/)-E,,^,(/))'< ((1 + V^a;3(e)) 7;2(i.„ z.,) + 0(e)) ^ [V/I^d/., 

Since the weighted transport distance T^ii'i, Vj) is dominating and we can assume w.l.o.g. 
that 'T^{vi,Vj) > 1, we arrive at the estimate 

Now, the Theorem 1.9 follows directly from an application of the estimate (3.20) of 
Lemma 3.11 and setting w(e) = | Ineja. 

4. The local Poincare inequality: proof of Theorem 1.7 

In this section, we are considering only one of the basins of attraction and therefore 
we omit the index i. We will write 17 and /i instead of fij and respectively. Further, 
we assume w.l.o.g. that G is the unique minimum in fi. 

We start with some heuristics for the validity of Theorem 1.7. We consider the Gibbs 
measure /i restricted to a basin of attraction Q given by 

We have to show that this restricted Gibbs measure fi satisfies the PI(g) with constant 
g'^^ = 0(e) (cf. Lemma (1-5)). We argue as follows: On a basin of attraction Q. can only 
be one local minimum of H, which is located w.l.o.g. at 0. Therefore, on cannot exist 
another metastable state of the diffusion given by (1.3). This means that for small 
noise e the diffusion slides down on the energy landscape without any obstacle until 
it reaches a small region around the local minimum of H at 0. This heuristically implies 
that only the small region around is important for the Poincare constant of^. Therefore, 
the Gibbs measure /i restricted to the basin of attraction and the Gibbs measure 
restricted to a small neighborhood of should have the same Poincare constant in terms 
of scaling in e. However, around the local minimum the Hamiltonian H is strictly 
convex due to our non-degeneracy assumption (1.5). Now, an application of the criterion 
of Bakry-Emery [BE85] - it connects convexity of ^ to the Poincare constant of /i - 
yields that the Gibbs measure /x restricted to a small region around should satisfy the 
Pl{g) with constant = 0{e). 

Now, let us turn to the main idea of our proof of Theorem 1.7. First of all, we want to 
note that the standard criteria for the Poincare inequality do not cover our situation. 
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• The criterion of Bakry-Emery [BE85] cannot be applied because H is not convex 
on the basin of attraction J7. 

• The perturbation lemma of HoUey-Stroock [HS87] cannot be applied naively be- 
cause it would yield an exponentially bad dependence of the Poincare constant q 
on e. 

Staying close to the heuristics from above, the Lyapunov condition would be the most 
natural approach to Theorem 1.7 (cf. Section 4.1 or [BBCG08]). However, one would 
have to construct an explicit Lyapunov function (cf. Definition 4.4) with rather subtle 
properties. Especially around critical points i.e. V-ff(x) = 0, this seems to be very 
difficult. 

For that reason we choose a different approach. The motivation for our argument comes 
from the one-dimensional case i.e. : IR, — t- R. In this situation. Theorem 1.7 can 
be deduced by using the Muckenhoupt functional (cf. Section 4.3 and [Muc72]), which 
determines the Poincare constant in one dimension up to a universal factor. Because of 
the non-existence of a multidimensional version of the Muckenhoupt functional, we have 
to reduce the multi-dimensional case to the one-dimensional case. 

For this purpose, we introduce on polar like coordinates ipr/ii") G ^ for r] G S'"'~^ and 
r > (cf. Section 4.2). The coordinates ipnif) are going to be a small perturbation of 
the coordinates resulting from the deterministic gradient-flow 

parameterized by arc-length i.e. |V'?y('^)| = 1- In these coordinates, the restricted Gibbs 
measures becomes 

^^{dr,dv) = ^""^^"^^ j,it) exp (^-^i?(V;,(r))) dr dr,, (4.1) 

where j-q{r) is the Jacobian determinant of the coordinate transformation. 

Now, we carry out a two-scale argument similar to the one used for the proof of the 
Eyring-Kramers formula (see Corollary 1.10 and Section 2). So, the restricted Gibbs 
measure /i(dr, dry) is decomposed into 

/i(dr, dr/) = n{dr\r]) fl{dr]), (4.2) 

where the conditional measures fj,{dr\r]) and the marginal fi{dr]) are given by 

H(dr\rj) = ^e-^^^'^^W^j-Jr) dr (4.3) 

f}{dr]) = — / e-i^(^(''))i^(r) dr dr/. (4.4) 
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Therewith, we get the same sphtting as m the discrete case of Lemma 1.6, i.e. 
var^(dx)(/(x)) = var^(dr_d,7)(/(V'r,(r))) 

= %(d,7) (var^(dr|r,)(/(V'»?('')))) + var^(d,,) (E^(dr|r,)(/(V'r?(?-)))) • (4.5) 

Let us consider the first term on the right hand side of the last equation. Note that 
the conditional measures //(drjry) are one-dimensional. Hence, we are able to deduce 
Poincare inequahties for the conditional measures //(dr|r/) using the Muckenhoupt func- 
tional. Of course, this step is very sensitive to the choice of the coordinates 
However, a careful and non-trivial construction of the coordinates yields the desired 
scahng of the Poincare constant. The next statement contains the existence of good 
coordinates {V'r^l^gs' 

Proposition 4.1. Assume that the basin of attraction is bounded. Then, there exist 
coordinates {V'rjj^g^Ti-i on such that the conditional measures n{dr\r]) are radial-mixing 
measures in the sense of Definition 4-M- 



The definition of radial-mixing measures is technically and for the moment means that 
the Hamiltonian evaluated along coordinates lines ff('0^(r)) fulfills certain monotonicity 
assumptions and the Jacobian determinant jn{r) of the coordinate transform shows a 
controlled blowup behavior, not causing any metastabilites. Because the proof of Propo- 
sition 4.1 is relatively elementary but a bit lengthy, it is carried out in full detail in the 
Appendix A. We want to point out that Proposition 4.1 demands the boundedness of the 
domain Q. However, this does not present a problem, because in Section 4.1 we will show 
by a Lyapunov argument that we may indeed assume that Q is bounded (cf. Corollary 4.7 
below) . 

The next statement shows that radial-mixing measures are satisfying the Poincare in- 
equality PI{q) with the right-scaling Poincare constant g. 

Proposition 4.2. Let lJ,{-\ri) be a radial-mixing measure in the sense of Definition 4-M- 
Then the measure /x(-|?7) satisfies PI(^)(r/)) with constant 

— L = 0(e) for 

q{v) 

uniformly in rj £ S'^~^. 



The proof of Proposition 4.2 is carried out in the Section 4.3. It is based on some 
properties of radial- mixing measures that are outlined in Section 4.2. 

Now, let us consider the second term on the right hand side of (4.5), which is given by 

var^(d,)(/(r/)) = J (/(t?) - J f{e) /i(d0)) fi{dr,) (4.6) 
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by using the notation 

fiv) = lE^(dr|r,)(/(V'r,(r))). 

Unfortunately, an application of the same strategy as in [GOVW09] would not yield the 
desired estimate. The reason is that, following [GOVW09], one would apply a Poincare 
inequality for the marginal measure fi{drj) and then estimate the coarse-grained gradient 
by the full gradient via a covariance estimate. However, the Poincare constant for the 
marginal measure fi already is at least of order 1, since fi lives on the sphere There- 
fore, this strategy could not yield the desired Poincare inequality PI(^) with constant of 
order g^^ = 0{e). 

For that reason we use a different strategy. Following the proof of Corollary 1.10, we 
represent the polar variance given by (4.6) as mean-difference. More precisely, the polar 
variance takes the form 

var^(d,)(/(7?)) = 111 {fiv) - m)\{dv) /i(d^). (4.7) 

From this starting point, the procedure is somehow similar to the proof of the mean- 
difference estimate of Theorem 1.9. However, the procedure needs some more evolved 
ingredients due to the fact that the marginal measure fi lives on the continuous state space 
S"'~^ and the conditional measures fi(dr\r]) have one-dimensional support. The argument 
is outlined in detail in Section 4.4, in which the following statement is deduced. 

Proposition 4.3 (Polar mean-difference estimate). It holds the estimate 

{fiv) - />))'/i(dr/) Kde) < 0(e) J \Vffdfi. 

Now, we have provided all the ingredients that are needed for the proof of Theorem 1.7. 

Proof of Theorem 1.7. The starting point of the proof is formula (4.5). By Corollary 4.7 
from below we may assume w.l.o.g. that is bounded. Therefore, Proposition 4.1 allows 
us to apply the local Poincare inequality of Proposition 4.2, which leads to the estimate 

var^(dx)(/(2;)) < 0(e) j j |5r/(V'>7(?'))lV(dr|??) A(dr?) -h var^(d„) (/(??)) 

= 0{e) j j |V/(^,(r))|2|^,(r)|V(dr,dr?)+var^(d,)(/(r?)). 

=1 

We now need to consider the second term on the right hand side of the last inequality. 
The polar variance var^(d»?)(/(^)) is expressed by (4.7) as a mean-difference. Hence, an 
application of the estimate of Proposition 4.3 yields the desired statement. □ 

It is only left to verify the ingredient used in the proof of Theorem 1.7. In the Section 4.1, 
we deduce Corollary 4.7. In Section 4.2, we introduce the notion of radial-mixing mea- 
sures and state some important properties of them. In the Appendix A, we carry out 
the proof of Proposition 4.1. In Section 4.3, we will prove Proposition 4.2. Finally, in 
Section 4.4, we state the proof of Proposition 4.3. 
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4.1. Reduction to bounded domains 

In this subsection, we will show that by condition (Al) and (A2) the Poincare constant 
of [I is dominated by the Poincare constant of a restricted measure /i^ (cf. Corollary 4.7), 
which is given for some i? > by the density 

. lonBfl(o)(a^) . 
Aiii(dx) = /x(dx). 

For this purpose, we apply a standard technique in the context of Poincare inequalities: 
the Lyapunov condition. 

Definition 4.4 (Lyapunov condition). Let : — J- IR be a smooth Hamiltonian and 
let 

denote the associated Gibbs measure \x. Then, : J7 — t- [1,cxd) is a Lyapunov function 
for H provided there exist constants i? > 0, 6 > 0, and A > such that 

1 LVF< -AH^ + 61n^. (4.8) 

For a good paper about the use of the Lyapunov condition for deducing Poincare inequal- 
ities we recommend [BBCG08]. The main ingredient of this technique is the following 
statement. 

Theorem 4.5 (Theorem 1.4. of [BBCG08]). Suppose that H fulfils the Lyapunov con- 
dition (4.8) and that /x/j satisfies Fl(gfi). Then also /i satisfies PI(f?) with constant 

Q > 



Even if the proof of Theorem 4.5 is simple, we refer the reader to [BBCG08]. We want to 
apply Theorem 4.5 to our situation. Hence, we do not only have to verify the Lyapunov 
condition (4.8) but also have to investigate the dependence of the constants i?, 6, and A 
on the parameter e. This is carried out in the next lemma. 

Lemma 4.6. Assume that H satisfies the assumption (Al) and (A2) with constants 
Ch > o-nd Kh > 0. Additionally, assume that 

1 cl 



2CI + ^Kh 

Then H satisfies the Lyapunov condition (4.8) for some e-independent constants R and 

8e ■ 



(j2 

b and an e-dependent constant A = -5^ 
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Proof. We take W = as Lyapunov function and observe that 

= AH + \VHf - e-'^ \VHf = - (^(e^^ - 2) \VH\^ + {\VHf - AH)^ . (4.9) 
Now, by the assumptions (Al) and (A2) we find R < oo such that 

y\x\>R: \VH{x)\ > \Ch and \VH{x)\^ - AH{x) > -2Kh. 
Therewith, we estimate (4.9) in the case |x| > as 
ILe^ 1 „o 1 „9 „ 1 „9 . . . , 1 



- + -,Cl + 2Kn < --Cl =: -A, provided that . < ^^^^^^ 



-'H 

In the case |x| < R, we get from (4.9) by omitting the term with negative sign 

<AH + iVHf < sup (ah + \VHf) b, 
e \x\<R^ ' 

which completes the argument. □ 

As a simple consequence of Theorem 4.5 and Lemma 4.6, we obtain the main result of 
this subsection, which allows to consider w.l.o.g. bounded domains Vl for the proof of 
Theorem 1.7. 

Corollary 4.7. Assume that H satisfies the assumption (Al) and (A2). Additionally, 
assume that /i/j satisfies PI(f)/{) with constant = 0{e). Then /x also satisfies Pl{g) 

with constant | = 0(e). 



Proof. We can apply Theorem 4.5 and with Lemma 4.6 and find 



Q ~ ^QR Cjj \QR 



< = _^ ( _^ + 1 ) = 8e (0(e) + 1) = 0(e) 



□ 



4.2. The right coordinates and properties of radial-mixing measures 

In this subsection we discuss the right choice of local coordinates {ipri)neS"'^ ™ order to 
apply the two-scale proof of Theorem 1.7 of the last section. Moreover, we introduce the 
notion of radial-mixing measures (cf. Definition 4.14) and state some properties, which 
are needed in the proof of Proposition 4.2 in Section 4.3. 

An obvious question is: Why are we not using the coordinates ipriir) resulting from the 
deterministic gradient-flow 
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parameterized by arc-length i.e. = 1? The Poincare inequahty of [i is strongly 

connected to ergodic properties of the diffusion given by (1.3). Intuitively, these 
coordinates should describe the diffusion very well as we are only interested in the 
regime of vanishing noise e. For this reason, deterministic gradient-flow coordinates iprji^) 
should be a natural choice of coordinates. 



However, we are not using these deterministic gradient-flow coordinates because the 
conditional measures 

-ili(Mr))j^(^r) dr 



fi{dr\r]) 



1 



Z 



given by (4.3) will not show the right scaling of the Poincare constant. A detailed analysis 
reveals that problems only arise in neighborhoods of critical points x, i.e. Vi/(x) = 0. 
Therefore, we will only change the coordinates in neighborhoods around critical points 
(cf. (HI) of Assumption 4.8). Heuristically, this observation seems to be plausible as, 
at critical points, the gradient of H becomes comparable to the noise. Hence, at least 
there, stochastic effects have to be taken into account. The main stochastic effect to be 
considered is that the noise does not have a preferred direction. Thus, it is better to 
smoothly transform the deterministic gradient-flow coordinates around local extrema to 
polar coordinates (cf. (H2) of Assumption 4.8). 

However, a more serious problem arises if one 
considers directions t] G S""^ such that the 
coordinate lines ipn{f) almost hit a saddle 
point of H. For such directions r], the Hamil- 
tonian 

Hr,{r) := ^H{i;r,{r)) - In j^(r) (4.10) 



of the measure /x(dr|r/) has a metastabil- 
ity in the case of deterministic gradient- 
flow coordinates ipr]if), since Jrj(r) becomes 
unbounded to -|-oo. Here, the following 
stochastic effect has to be taken into account: 
If one follows a trajectory of the diffusion 
given by (1.3) starting anywhere above the 
saddle, one will almost never get too close to 
the saddle point. The reason is that the pro- 
cess S^t will be pushed away from the saddle 
as soon as the noise becomes comparable to 
|V-ff(^t)|. This stands in contrast to deter- 
ministic gradient-flow coordinates, where you can get as close to the saddle as you want. 
Heuristically, this shows that deterministic gradient-flow coordinates are not appropriate 
at the saddle for the stochastic dynamics. 

As you can see by Figure 2, we adjust the deterministic gradient-flow coordinates around 
the saddles in such a way that you will never get close to the saddle by following a 




Figure 2: Deterministic gradient-flow co- 
ordinates obtained from H 
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coordinate line from above the saddle. The origin of this idea goes back to a discus- 
sion of the authors with Martin Hairer. Technically, these ideas are manifested in the 
properties (H3), (H4), and (H5). Later, these properties guarantee in the proof of 
Proposition 4.2 that in the Hamiltonian -ff-^(r) there are no metastabilites for any direc- 
tion T] (cf. Section 4.3). 

As we have outlined above, the desired coordinates ipr/ii^) will be slight modifications of 
the coordinates resulting from the deterministic gradient-flow w.r.t. the Hamiltonian H. 
An easy way to construct such coordinates is to consider a small perturbation H of the 
Hamiltonian H. Then, we define the modified coordinates ipn{r) by the deterministic 
gradient-flow w.r.t. the perturbation H. In the next assumption we state some proper- 
ties of the perturbation H that guarantee that the resulting coordinates will serve our 
needs. 

Assumption 4.8 (Perturbation H of H). We assume that the domain ^1 is bounded. 
Additionally, we assume that the perturbation H £ C'^(Q,\{0}) Pi C^{(}) of the Hamilto- 
nian H satisfies the conditions: 

(i) H equals H up to small 5 -neighborhoods around the critical points of H, i.e. by 
recalling that S is the set of critical points of H we have 

Vx ^ U Bsiy) : H{x) = H{x). (HI) 
yes 

(a) H restricted to a ball around the local minima located in of radius | > is 
spherically symmetric and linear, i.e. 

yx € Bs{0) : H(x) = H{0) + \x\. (H2) 

2 

(Hi) \VH\ is uniformly bounded from below in Q\{0}, i.e. 

> Vx G 0\ {0} : \VH{x)\ > c^. (H3) 

By using the Assumption 4.8, we are able to define the coordinates ipr){T) as follows. 

Definition 4.9 (Local coordinates ip-riir)). Assume that H satisfies the Assumption 4.8. 
For r G R, we consider the associated flow ^j. : i}\ {0} — >• r2\ {0} defined as the solution 
of 

^o(^) = X and = FiMx)) := .Zfi^il!, - (4-11) 

For r] G S^~^, the coordinates tpn ■ (0,7^^) — )• IR" are defined by 

Mr) = ^s{lv), (4.12) 



where the domain (0, T^) is chosen to be maximal. It easily follows from the Assump- 

Tr,<Cfj for all r/ G 5"-^ (4.13) 



tion 4.8 that there is a constant Cjj < oo such that 
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Remark 4.10. Note that the coordinates ■0^ are well-defined by the property (H2). 
Namely, the coordinates iprj{i^) satisfy for r € (0, |) the equation 

\VH{iP^{r))\ 



Therefore ^ ^(j^) = fo^' ^ (0, j)- This shows that il'vi''') polar coordinates 

'""2 

around the local minima. 



As we have explained above, it is very important to have a good understanding of the 
Hamiltonian 

of the conditional measure fi{dr\r]). In the next statement, we consider the evolution of 
the second term of the last identity, namely the Jacobian determinant jn{f)- 

Lemma 4.11 (Evolution of 3r^{r)). For x G {0} let D"^r{x) be the Jacobian of the 
flow given by (4.11). Then the Jacobian determinant 

j^{r) := detDvI/^_|(|r/) 
evaluated along the gradient-flow lines 4^-q{r) (cf. (4.12)J satisfies the evolution 

VrG(0,r^): ^\uj^{r)=\I-F{^r,{r)). (4.15) 
Proof. Differentiating the identity (4.11) leads to 

Dilrix) = DF{^r{x)D^r{x). 

As D'^r is regular, we can multiply by {D'^r)'^{x) and take the trace 

tr (Di!r{x){D-^rY^{x)] = iiDF{^r{x)) = V • F{^r{x)). 



By Jacobi's formula (B.3) the left hand side of the last identity can be written as 

d 



^^\udeiD^!r{x) = tr (p^ir{x)D^!-^{x)^ , 
which yields the desired formula (4.15). □ 

With the help of the last statement, we are able to formulate a condition (cf. (H4) 
below) on the perturbed Hamiltonian H. The condition (H4) ensures that the Jacobian 
determinant Jnif) behaves in the right way when applying the Muckenhoupt functional 
later (cf. Proposition 4.2). 
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Lemma 4.12 (Bounds on j^(r)). Assume that the perturbed Hamiltonian H satisfies 
the Assumption 4-8 and that the local coordinates iprji''') dfs given by Definition 4-9. 
Additionally, assume that H satisfies the condition for r > 5/2 

-C^max{l,(r^-r)-2} < V-F(V'„(r)) <C^, (H4) 

for some constant Cjj > 1. Then the Jacobian determinant jnir) satisfies 

Vre(0,|]: j,(r)=r"-i, (jl) 
VrG(|,f,): Cj,mm{l,exp{-Cj,{T^-r)-')}<j^{r)<C^ (j2) 

for some constants < Cjj < Cjj < oo, uniformly in r/. 



Proof of Lemma 4- i 2- The property (jl) is a direct consequence of the fact that the 
coordinates tprji^) are spherical coordinates for r < | by Remark 4.10. 
Let us consider the property (j2). By the fundamental theorem of calculus and the 
identity (4.15) we obtain for r G (|,T^) 

In j^(r) - In j,(|) = T V • F(^^(s))ds. 
Taking the exponential in the last identity leaves us with 

M^) = J^(f)exp ( V • F{i;^{s))ds] . 



Employing the uniform upper bound on < Cj^ (cf. (4.13)) and the upper bound 

of (H4) immediately yields the desired upper bound on jn{r). 

Let us now consider the lower bound of (j2). Straightforward calculation yields 

exp (^/| V • FiMs))^^^ > exp ^-C^ max{l, (T^ - s)-^} dsj 

/ /.max{r,r^-l} \ 

>e-^H''expl-Cfj J (r^-s)-MsJ 
= e-^^r exp {-Cfj{T^ - max{r^ - 1, r})~^) . 

Employing again the uniform upper bound on < Cg (cf. (4.13)) yields the desired 
lower bound. □ 



We still need to understand the first term in the Hamiltonian if^ given by (4.10), namely 
H{ipr){f)). The growth estimates (hi) and (h2) are one of the main ingredients to apply 
of the Muckenhoupt functional in the proof of Proposition 4.2 (see Section 4.3). 
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Lemma 4.13 (Properties of Assume that the perturbed Hamiltonian H satisfies 

the Assumption 4-8 and that the local coordinates ^r]{^) dfs given by Definition 4-9. 
Additionally, assume that H satisfies the condition 

(Vi/(Vr,(r)),VF(Vr,(r))) >c^min{l,r,r^-r}|V^(V^^(r))| (H5) 

for some constant < Cj^. 

Then, the following estimates hold uniformly in ij: 

3Xi>0yse[0,Tr,):H{i;ri{r))>H{^^,{s)) + \i{r-s)\ VrG[s,r^] (hi) 
3A2 > Vs G (0, T^] : Hii^nir)) < H{^^{sr)) - \2{r - sf, Vr G [0, s]. (h2) 

Proof of Lemma 4-13. For r < it follows from Definition 4.9 and (H5) that 

^^H{Ur)) = (yniUr)), ^W^j^^ ^ c^min{l,r,r, - r] . 

Hence on (0,T^), the radial Hamiltonian /ijj(r) is strictly increasing. Additionally, we 
have with (4.13) that Trj < Cj^ < oo holds uniformly in rj. Therefore, it is easy to 
deduce (hi) and (h2) with constants Ai and A2 uniformly in r]. □ 

Now, we have provided all the ingredients needed for the proof of Proposition 4.2 in 
Section 4.3. For convenience, we make the following definition. 

Definition 4.14 (Radial- mixing measures). Assume that the perturbed Hamiltonian H 
satisfies (HI), (H2) and (H3) of Assumption 4.8 and that the local coordinates ipr^{r) 
are given by Definition 4.9. Additionally, assume that H satisfies the conditions (H4) 
and (H5). Then, the conditional measures 

fj,{dr\r]) = j^(r)exp ^-^F('i/'r,(r-))^ dr 

are called radial-mixing measures. 

The existence of radial-mixing measures is shown in Proposition 4.1, which is verified in 
the Appendix A. The term mixing in the definition of radial-mixing measures is justified 
by Proposition 4.2, which proofs that the Poincare constant g^ij) of iJ,{dr\r]) satisfies 

4.3. The Muckenhoupt functional: proof of Proposition 4.2 

The aim of this section is the proof of Proposition 4.2. In the argument we apply the 
Muckenhoupt functional [Muc72]. For the convenience of the reader, we restate the main 
result of [Muc72] in Theorem 4.15 below and clarify the connection of the Muckenhoupt 
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functional to the Poincare inequality in Proposition 4.16. For the sake of completeness 
we also state the proofs of Theorem 4.15 and Proposition 4.16 in full detail. 

The following theorem is a weighted Hardy inequality stated by [Muc72] in a more general 
context with a sharp characterization of the optimal constant up to a factor 4. 

Theorem 4.15 ([Muc72, Theorem 4]). Let ji he an absolutely continuous measure on 
IR,+ with values in U {+oo}, then there exists a constant C < oo for which 

/•OO / 1'^ \ 2 /"OO 

V/:E+^E U^f{t)dt]tiidx)<Cj^ /V(d^), (4.16) 
if and only if 

poo pr 

5 = sup / dfi — - dx < oo. (4.17) 



r>0 



r 



Kx) 



Furthermore, the optimal C obeys the estimate 

B<C<4:B. (4.18) 

Proof of Theorem 4- i 5. The proof consists of two steps. In the first step we show 
that (4.17) implies (4.16) with constant C < AB. In the second step, we will show 
that (4.16) implies (4.17) by the the estimate B < C, which then establishes (4.18). 

Define the function G{x) = j^dy. The first step is to use Cauchy-Schwarz inequality 
for the inner integral 

Now, note that with 2 4: \/ G(t) = , — j- — the last integral evaluates to 

1 

/"^ 1 / \ ~ 2 

i»*=^^^^^(X V ' 

where the last estimate follows from the definition of the constant B. Combining (4.20) 
and (4.19), integrating the resulting inequality w.r.t. to /i and using Fubini's theorem to 
interchange the integration, we arrive at 



^J^ f{t)dtj i,{dx)<2^ l/(Ol'\/G(t) I y (^y d^j "Mdx)jMdt). 

(4.21) 



Now, we observe that 
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Therewith, the inner integrals in (4.21) evaluate to 



d/i ) ' /x(dx) = -2 / ^{ \ dfiY dx = 2 ( [ d^V < 2 ^ 



(4.22) 

by the definition of B. Combining (4.22) and (4.21) results in the desired upper bound. 

For the lower bound assume that / is non-negative. We can bound the left hand side 
in (4.16) for fixed r > from below by 



f{t)dtj < ^ f{t)dt\ fiidx) <Cj^ 
Now, we want to show that for any r holds 



|/(x)|V(dx). (4.23) 



oo /-r 



dM / dx < C, (4.24) 







which implies B < C. In the case -jj^ dx = the estimate (4.24) is immediate. 
In the case Jq j^j^ dx = oo exists a function /(x) with JJ" |/(x)|^ d^(x) < cxd and 
/(x)dx = oo. Then (4.23) implies that dfi = 0, which also yields (4.24). In the 
last case, namely < Jq -j^^ dx, set /(x) = and observe that (4.23) becomes 



d/^ / —r^ dx <C —— dx, 

which gives, after dividing by ^^^dx, the desired estimate (4.24). □ 

From the previous theorem it is easy to obtain the following proposition, which determines 
the Poncare inequality in one dimension up to a factor of 8 for a general measure. The 
argument was found by [BG99], which considered the logarithmic Sobolev inequality in 
the continuous case and also by [Mic99] , which considered the Poincare inequality in the 
discrete case. 

Proposition 4.16 (From Muckenhoupt functional to Poincare inequality). Let fi be a 

probability measure on R absolute continuous w.r.t. to the Lebesgue measure and values 
in I[l+ U {+00} . Then the constants B^ and B^ defined by 



X 



Bm = sup ( / -^dy I d/i 



_ Jm i^yy) Jx 

are finite for fi-a.e. m if and only if they are finite for one common m. 
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Further, ^ satisfies a Poincare inequality PI{q) if and only if and are finite. In 
this case g obeys the estimate 

max{(l - F^{m))B~, F^{m)B:^} < g'^ < 4max{5^,5+} , 

where F^(m) = ^((— oo,m]). 

Remark 4.17. A common cited version (see for instance [Fou05]) of the above proposition 
is given by setting m = m* with m* equal to the median of i.e. ;u((— oo, m*]) = In 
this case, the estimate of Proposition 4.16 has the form of 

^max{S~.,S„*} < g^'^ < 4 max { S^;. , B^. } . 

This gives a general characterization of the spectral gap or Poincare constant up to a 
factor of 8. However, the median is hard to compute in general and the result stated in 
Proposition 4.16 is easier to obtain upper bounds. 

Proof of Proposition 4- 16- In the first step we will show that if m* exists such that -B~, 
and B^t are finite, then i?~ and B^ are finite for fi-a.e. m. Therefore, we write (x) = 
il" ~ij^^y /-OO such that with this notation i?~ = sup^<^-B~(x) and similarly for 
B^{x). By symmetry it is enough to consider m > m* . Then, an immediate consequence 
is B^ < B^f To obtain a bound for with x < m, we split up the integral in the 

definition of and obtain 

I'm ^ I'X I'm* t'X I'm j'^ 



— OO 



< t(-oo,m*\{x)B;;^,{x)+BX,im),r^4^ < B:„, + Bt ^^^""^ 



l-F^(m) - ™'1-F^(m)' 
which is finite for m with < F^{m) < 1, hence for /i-a.e. m. 

Now, we want to deduce the upper bound for the inverse spectral gap, which follows by 
noting that for an arbitrary m G H holds 

/OO pm poo 

if - f{m)f d^ = / (/ - f{m)f d/.- + / (/ - f{m)f d^+ 
-OO J —OO Jm 

(4.25) 

where we introduced the measures 

^„(dx) = l(„oo,m)(a;)/i(dx) and ^+ = l(^_+oo)(a;)/^(dx). 

We can apply Theorem 4.15 to both of the last integrals in (4.25) with the measure /i 
substituted by /i^ and obtain the estimate (4.16) with a constant C^, which is bounded 
by 4i?^. For /i^ this leads to 



{f-f{m)fA^Jii 



roc / py \ 2 poo 

/ / f'{x)dx) d^+(y)<4S+ / |/f d/.^ 

J m \J m / Jm 
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holds 



and similarly for fi^. This proves out right after again combining both integrals 

/m poo poo 

|/f d/.-+4i?+ / |/f d/.+ <4max{S„,S+} / |/f d^ 
-oo Jm J —oo 

the upper bound. 

Once again we start with equation (4.25) and consider the integral over (m, oo). Now we 
choose < r < and a function Qt such that for the optimal constant in 

poo poo 

/ (/-/(m))2d^<C+ / |/fd^ 
/ {gr-gr{m)fdf,>{C+-T) |g;|'d/i. (4.26) 

Jm J m 

These gr exists thanks to the optimality. By Theorem 4.15 we know that < C^. We 
can assume w.l.o.g. grinT-) = 0, else we could consider gr{x) = grix) — grim). Now, we 
set gr{x) = for X < m and note /u({(7t- = 0}) > fi{{—oo,m])) = F^{m). Therewith, it 
follows ^ 

gAl^ < /x(K # 0}) j g^^dfi < (1 - F^{m)) j g^.dn. 
This results in the estimate 

var^(s-^) = j g'^dfi - (^j grdjj^ > F^{m) j g'^d^ 

(4-26) , /■ , ,9 , /■ , ,9 

> F^(m)(C+-r) J \g',\'df,>F^{m){B+-T) J \g',f dfi. 



Hence, g ^ > F^{'m){B^ — r) and sending r — )• the desired result follows. The case for 
the integral on (— oo,m) follows similarly by symmetry. □ 

Now, we have completed all the preparatory work and can directly proceed to the proof 
Proposition 4.2, which is restated at this point. 

Proposition 4.2. Let li{-\rj) he a radial-mixing measure in the sense of Definition 4-M- 
Then the measure /x(-|t/) satisfies PI(^)(r/)) with constant 

^ 0(e) for 



q{v) 

uniformly in r] £ S'^~^. 

Proof of Proposition We deduce the desired statement by an application of Propo- 
sition 4.16. Therefore, we have to show that B^ = 0(e) for a particular choice of m. 
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For this purpose, let us set m = ^yfj^- Additionally, we assume that e is small enough 
such that m < I with | given by the conditions (HI) and (H2). 

The measure /i(-|r/) is absolutely continuous and has by denoting h^ir^ '■= H{iljrf{r) the 
form 

/i(dr) = -7rj-^j-q{r)e ^ dr, where fi{r]) = J jrj{r)e ^ dr and /i,,(0) = /i^(0) = 0. 

Furthermore, by (H2) it holds h'^{0) = {r],V^ H{0)r]) > X^i^ > by the non-degenerate 

Assumption 1.3. We start with the estimation of B^{r) for r < m = 0(e2). Then, 
according to (jl) of Lemma 4.12 it holds = r"~^. Now, we use the expansion 

/i^(r) = h'l^i^Y^ + 0(r3) = /i;'(0)^ + 0(ei) to find the estimate 

= j exp ( + 0(Vi) j dt j exp ( + 0{^e) \ dt 

We continue with the variable substitution r = sm and t = zra and arrive at 
5- (r) < m2 j'^ exp dz 2-"+^ exp (^^^ dz (l + 0{^/^)) 

The constant En is given as upper bound in the estimate 

<1 <^ I. (?i-2)s" ^ ' 

Now, we bound the right hand side of the above estimate. In the case n = 2, the function 
s I— )• s^ln - attains its maximum for s = A= with value 77-. Hence, we have Eo = t^. 

In the case n > 2, the function s 1— t- s^(1 — attains its maximum for s = {^)"~^ 

with value (^) "^^ (l — ^) < Hence, we can set En = We note that in any case 

holds En < 1 independently of n. 

Now, we continue with the estimation of B^{r) for r > m. Therefore, we can use the 
properties (hi) and (h2) of Proposition 4.13 as well as the fact the j,j(-) is non-zero in 
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{0,Tn) by Lemma 4.12 



where we have used the Laplace method in the last step. □ 



4.4. A polar mean-difference estimate: proof of Proposition 4.3 

This section is devoted to the proof of Proposition 4.3. We have to show the estimate 

// (/(r/)-/>))'/i(dr/)/i(d^)<0(e) j jV/l^d^. 

The procedure is similar to the proof of the mean-difference estimate of Theorem 1.9. 
Analogous to section 3.2, we show in the first step that it is sufficient to consider the 
mean-difference estimate w.r.t. the simpler measures v{-\ri) that are Gaussian approxi- 
mations of the measures ^{■\'r]). This step is the content of Lemma 4.18. Following the 
ideas of Section 3.1, we estimate in the second step the mean-difference w.r.t. the approx- 
imations i'{-\rf) by using a transport argument. This step is the content of Lemma 4.19. 
Compared to Section 3.1, we have to argue more carefully since the support of the mea- 
sures i'{-\ti) is only one-dimensional. Therefore, we need an additionally ingredient in 
the proof of Lemma 4.19: It is an identity for the spherical mean and is provided by 
Lemma 4.23. 

Provided the ingredients are valid, the proof of Proposition 4.3 just consists of a straight- 
forward application of Lemma 4.18 and Lemma 4.19. 

Now, we turn to the first step. Let us introduce the approximation for the measures 
li{dr\rj) and fi{dr]). Recall that in Section 3.2 we introduced the truncated Gaussian 
measures z/j on any domain of attraction (cf. (3.6)). In this section we had the 
convention to omit the index i and considered to be a domain of attraction. We also 
assumed w.l.o.g. that the unique local minima of H on 0, is located at G fl. Using 
these conventions we may write the approximation of on as 

u[dx) = — l£;(x)e 2e dx, with Zy = j e 2e dx, 
■^y JE 

where the set E is given by 

= {x G R" : \^/V^H{0)x\ < V2e uj{e)}. 
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In the new coordinates given by il^nii^), the measure ^'(dx) becomes (cf. (4.1)) 

1 _i V^-ff(0)[»7]r^ 

z^(dr,dry) = — lE{ipn{r)) e 2e dr drj. 

In the last identity we ah'eady apphed the fact that for e small enough the Jacobian 
determinant is given by j,j(r) = r"~^ (cf. (jl) of Lemma 4.12). Following (4.2), (4.3) 
and (4.4), we decompose the measure i/(dr, dr/) according to 

z^(dr, dr/) = zv(dr|r/) uldr]), 

where the conditional measures z^(dr|7/) and the marginal j>(dry) are given by 

uidrlr]) = lE(i/j„(r)) r" e 2e dr and 

1 /" _i _v^HXo)Wri 
i>(d?7) = 7^ / ^" e 2c dr dr/. 



Now, we are able to formulate the first ingredient of the proof of Proposition 4.3. 
Lemma 4.18 (Approximation by truncated Gaussians.). Using the notation 

fiv) ■= J /(V'r,(r'))/i(dr|r/) and f{r]) := J /(V',,(r))z^(dr|7/), 

it holds for any function f that 

II (/(r?)-/>))'A(dry)A(d0)<3 H (/(r?) - /i(d,?)A(d0) + 0(e) ^ |V/pd/i. 
Proof of Lemma 4-18. Using the triangle inequality we can estimate 

(/(r?) - />))' < 3 ((/(t?) - /(r?)) V (/(r/) - f{e))\ [fie) - my 

By applying the same strategy as in Lemma 3.6, we can estimate the first and third term 
of the last inequality by 

var fMM^ 

where g^j is the Poincare constant of fJ-{-\r]). By Proposition 4.2, we know that 

Q,' = 0{e). 

Additionally, one can show as in Lemma 3.8 that 
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Finally, we only have to express drf in terms of the full gradient V/. Because of the 
parametrization of the coordinates ipn{f) by arc- length, we have |';/'»?(^)| = 1- This yields 
the identity 

\drf{Ur))\ < |V/(^,(r))| |^,(r)| = |V/(V^,(r))| , 
which completes the argument. □ 

Now, let us turn to the second ingredient of the proof of Proposition 4.3, namely: 
Lemma 4.19. Using the notation 

m ■■= I f{Mr))Hdr\v), 
it holds for any function f that 

I [ {hv) - hO)fmriW{0) < 0{e) j jV/l^d^. (4.27) 

The last statement is verified by using a polar transport interpolation argument. Before 
turning to the proof of Lemma 4.19, some preparatory work has to be done. 

For the transport argument showing (4.27), we embed v{-\r]) and li{-\ri) along rjr as one- 
dimensional measures in K," and note that i'{-\'r^) <C ^J'{•\'^]) for all rj € 5"""^. Then, we can 
still define for r],6 £ S'^~^ a transport distance like 7^ by considering the Radon-Nikodym 
derivative in the cost density w.r.t. IJ-{-\ri). 

We denote by <(r/, 9) C S"^^ the geodesic on S^~^ connecting r] and 9 parametrized on 
[0, \<{r], 9)\], where |<(t/, 9)\ is the arc-length of the geodesic bounded for any r],9 £ S^^^. 
Note that <{r],9) is unique for almost all rj,9 £ S^~^. Let (f?s)se[o,|<(r?.e)|] be its constant 
speed parametrization. Then consider the transport interpolation defined on supp i/(-|7/o) 

by 

^simr) ■■=(r!a^'^7]sr where = V^H{0)[r]s]. (4.28) 

The transport interpolation is chosen such that 

($,)p(-|7?0) =Z^(-|f?s). 

In the proof of Lemma 4.19 we need an estimate on the transport speed, namely: 
Lemma 4.20. With the definitions from above it holds 



|<i>.oci);i(rr/.)|<ryi+(^) , (4.29) 
where Xmin md Amax is the largest and smallest eigenvalue ofV^H(0), respectively. 
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Proof of Lemma 4-20. First, for the derivative and the inverse of <I>s, we have that 



Hence, the composition is given by 



1 (j.o 



2 ^(TsCTo 



and $5 ^(risr) = r]oas '^CF^r. 



2 a, 



By using r]s -L r]s and IVsl = |^s| = 1 we obtain the estimate 



/l ■ |2 , 1 I |2 



r\ 1 + 



The result follows from the definition of a, (4.28) and the estimate 



0"s 



□ 



The next lemma contains the main contribution to the proof of Lemma 4.19. It is the 
estimation of the mean-difference (/(?/) — f{0)) by applying a transport argument similar 
to the argument outlined in Section 3.1. 

Lemma 4.21. For any r],6 £ S''^~^ it holds 

(fiv) - /»)' < 0(e) / / |V/(rO| M(dr|0 ^'(d^), 
^ ^ J<iv,d) J 

uniformly in r],9. We want to recall that <{r],6) denotes the geodesic on S^~^ connecting 
r] and 9. 



Proof of Lemma 4.21. We estimate the mean-difference in the same spirit as in (3.1), 
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namely 

where kh = 1 + ^ ^ from (4.29). From this point, we estimate more along the lines 

of Remark 3.2 by using Cauchy-Schwarz w.r.t. L'^(fj.{dr\r]s)) and Jensen's inequality for 
the outer integral 

/- ~ \2 fl<(»?:^')l r ^ i/(dr\r) ) C 

[fiv) - fiO)) < \<{V,0)\ J \yf{rVst -J^l^) /^(d^l^^) J ^ ^(d^l^-) 

H<(r),0)| r r 

~ K^vr / / \Vf{rr]s)\ n{dr\rjs) / r i^(dr|r/s) ds 



Jo J J 
by using again the error estimate on the approximation given in Lemma 3.8 and therefore 
/j(dr|y ) ~ 1 + 0(v^|l'^^ (^)l)- Finally, note that the second moment of J^(r|r/s) is 0{e) 
uniformly in 7]^, due to the non-degeneracy assumption on the Hessian V^/7(0) in the 
minimum (cf. Assumption 1.2). □ 

Before turning to the proof of Lemma 4.19 we need two more ingredients. The next 
lemma contains an asymptotic characterization of the relative density of the marginal 
measure jl w.r.t. the uniform probability measure on 5"~^ denoted by 

Lemma 4.22. The relative density of jl w.r.t. the uniform probability measure on S"'~^ 
denoted by ^ can be estimated for any ^ S S"'~^ as 



df^ _ ^detV^HjO) 
d^^^' ^ (V2//(0)[e])f 

In particular, the last identity yields for all ^ € S''^~^ the uniform bound 



2 



X / - H ^ ) ■ (4.30) 

^min / Cls V ^ 

Proof of Lemma By the assumption (H2) the coordinates are spherically around 

the local minimum. Hence, expanding 77 in in the direction ^ yields 

H{ri) = H{0) + v'HiOm'^ + 0{r^). 
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1 






Jo 


1 






Jo 


1 


1 / 2e 




2 VV'^(0)K] 



Now, we choose such that H{0) = 0. Then it holds that 
fi{^) = — I e i5(r) dr 

dr (l + 0(V^w3^e)) + — / e — dr 

|)'rg) (l + 0(Via;3(e))+0(exp-^(^)). 

By noting that w (27re)t (det V^/f(0))^^^ we arrive at 

^ v/detV2g(0) r(§) ^ VdetV2g(0) 
^ (V2F(0)[e])t 2^t ^ (V2F(0)[e])t 

where we used the fact that the density <j(^) is given by the constant 



27r2 



□ 



The last remaining ingredient for the proof of Lemma 4.19 is the following basic state- 
ment. 

Lemma 4.23 (Spherical mean). Let q be the normalized uniform measure on S^~^. Then 
It holds for all F : S""! ^ L^{S''-\c;) that 

Iff F{0 n\dO ?(dr/) ,{de) = ^ / F{9) ,{de). 

JS"-^ JS"'^ J<(e,ri) ^ JS^-^ 

Proof. Choose e G S"""^ fixed. Then by the transitivity of SO" on exists for all 

6 G S*""^ a rotation qq G SO" such that qqO = e. Then, the result follows from the 
variable substitutions first ^ i— ?• gg^ and second r] i— )• 5(97/ 



F(0 l^^dC) ?(d7?) ?(d0) = J J J F{geC)n\dC),idr^),ide) 

5n-l 5n-l <t(e_^) 5n-l 5n-l <t(e^g-l^) 



S"-! 5"-i <(e,»?) 

By interchanging the order of integration from 7^^(d^) ?(dr/) ?(d^?) to <?(d0) ?^^(d^) ?(dr/) 
we find with the transitivity of SO" on S"~^ that 

/ F{gg^)(;{de)= I F{e)q{d9), independently of 
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The last step is to observe, that the remaining double integral is the mean length of a 
geodesic on S^~^ 



n 

<{e,ri) 



^(de)?(dr?)= / |<(e,7?)K(d7?) = ^. 



□ 



Finally, we can turn to the proof of Lemma 4.19 



Proof of Lemma 4- i 9. Integrating the inequality of Lemma 4.21 w.r.t. and fi{9) 

results in the estimate 

/(^)-/»)'A(dr/)/i(d^)<0(e) //// |V/K)|V(dr|0 ^^d^) A(dry)/i(d0). 

<{v,e) 

Hence, we arrive by setting F{^) = J |V/(r^)|'^ /i(dr|,^) at 

/(ry) - A(dry)A(de) < 0(e) // / F^O m\dO ?(dr?)?(de). (4.31) 

^ J J J<{ri,e) 

Applying Lemma 4.23 to (4.31) and changing the measure back with the help of (4.30) 
results into 





/(^)-/>)) A(d»7)A(d^) < 0(e) j J |V/(V,(r))|2 M(dr|ry) A(dr/) 

= 0{e) J iV/l^d/., 

which yields the desired statement (4.27). □ 



A. Construction of the perturbed Hamiltonian H: proof of 
Proposition 4.1 

Recalling the notation of Section 4, we only consider one basin of attraction fij (cf. (1.6)) 
and therefore omit the index i. We assume w.l.o.g. that € is the local minimum of 
H in i}. This part of the appendix is devoted to the proof of Proposition 4.1. 

Proposition 4.1. Assume that the basin of attraction Q is bounded. Then, there exist 
coordinates {V'rjj^g^Ti-i on Q such that the conditional measures fi{dr\r]) are radial-mixing 
measures in the sense of Definition 4-M- 



Hence, we have to show that on a basin of attraction 0, (cf. (1.6)) exists a perturbation 
H of the Hamiltonian H satisfying the conditions (H1)-(H5): 



51 



A. Construction of the perturbed Hamiltonian H: proof of Proposition 4.1 

(i) H equals H up to small (^-neighborhoods around the critical points of H, i.e. by 
recalling that S is the set of critical points of H we have 

yes 

(ii) H restricted to a ball around the local minima located in of radius | > is 
spherically symmetric and linear, i.e. 

Vx G ^5(0) : H{x) = H{0) + \x\ . (H2) 

2 

(iii) |Vi/| is uniformly bounded from below in r2\{0}, i.e. 

3cj^ > 0,yx en\{0} : \VH{x)\>Cjj. (H3) 

(iv) Gradient flow coordinates (V'r;('^))jjG5"-i obtained from H in the way described in 
Definition 4.9 satisfy for all r] £ S""^ 

VrG(V2,T^): -C^ max {l, (T^ - r)-^} < V • F(Vr,(r-)) < C^^, (H4) 

(v) and 

Vr G (0,rj : (V/7(V^^(r)),Vii'(Vr,(r))) > c^min{l,r,r^ - r} |VF(V,,(r))|. 

(H5) 

Because of (HI), the perturbation H is only allowed to differ from the Hamiltonian H on 
small neighborhoods around the critical points z £ of H. Therefore, we can consider 
every critical point separately and do an explicit local construction of the perturbation H 
on these neighborhoods of critical points. In Section A.l, we consider the case where 
the critical point is a local extrema of H. In Section A. 2, we consider the case that the 
critical point is a saddle point of H. 

A.l. Correction at local extrema 

In this section, we can omit the case where the critical point z is a local maxima. The 
reason is that the construction of H would be very similar to the construction of H 
around a local minima by considering —H instead of H. 

We denote by Amin the smallest eigenvalue of V^i^(O), which is strictly positive by the 
non-degeneracy Assumption 1.2. Hence, we can choose 5 > such that H is strictly 
convex on Bs(0). Let ^ G C°°(Il+, [0, 1]) be a decreasing smooth step function satisfying 
the conditions 

Vr G [0,1/2] : ^(r) = 1, Vr G (1/2, 1) : ^'(r) < 0, and Vr G [1, oo) : ^(r) = 0. (A.l) 
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A. Construction of the perturbed Hamiltonian H: proof of Proposition 4.1 

For convenience, we set £,s{r) = We define the correction H of H around the 

minimum as 

H{x):=C5{\x\) {\x\-8) + {l-Cs{\x\)) H{x), for x G 5^(0) . (A.2) 




Figure 3: Contours and fiow lines derived from H (left) and H (right). 



Now, we state the main result of this section. 

Lemma A.l. For 6 small enough, the function H satisfies the properties (H1)-(H5) in 
the neighborhood Bg(0) around the minimum G fi. 

Proof. The property (HI) is fulfilled by construction. 

The property (H2) is satisfied by the definition (A.2) of H and the definition (A.l) of ^, 
which ensures that the gradient-flow coordinates of H coincide with polar coordinates 
on the ball Bs/2{0). 

The property (H3) is verified by a straight forward argument, which we outline in full 
details for the convenience of the reader. Direct calculation leads to the identity 

\/H(x) = a(x) + b(x) VH(x), with (A. 3) 

a{x) = 6-'^'s{\x\) {\x\-6-H{x)) + Cs{\x\) and b{x) = 1 - ^s{\x\). 
For the verification of (H3), we will need two observations. 

The first one is a{x) > ^s{\x\), which follows for small enough 6 from our assumptions 
on H, namely H{0) = and H{x) > as well as the monotonicity (A.l) of ^, i.e. ^' < 0. 
The second observation is that for x G Bs(0) it holds 

(^,VHix)) = \x\ /^,V'H{0)^)+O{x')>\rnin\x\+O{5')>^\x\, (A.4) 

\l^l / \ fI fI / ^ 
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A. Construction of the perturbed Hamiltonian H: proof of Proposition 4.1 



which follows directly from Taylor expansion and choosing 5 sufficiently small. In the 
last inequality, Amin > denotes the smallest eigenvalue of \7^H{0). 
Let us now verify (H3). The identity (A. 3) directly yields for x S Bs{0) 

\VH{x)\ = a^{x) + 2a{x) b{x) V^-H'(x)^ + ^^(a;) \VH{x)\^ 

^>'^a2(x) + 2 a(x) b{x) ^\x\ + b\x) \VH{x)\'^. 
We consider two cases. In the first case, we assume that |x| < |. Then we get 

|V^(x)| > a^ix) + 0{6^i^>\ (1/4) + 0{6^) >cg>0. 
In the second case, namely j < \x\ < 6, we get 



|V^(x)| > 6^(x) \VH{x)\'^ + 0{6^ 



(A.l) 



> (1 - e (i))' min |VF(x)|2 + 0(5^) > > 0, 

|<kl<'5 

which yields the desired property (H3). 

Now, let us turn to the property (H4). It suffices to show that for x G Bs{0)\Bs/2iO) 
holds 



V ^ x) 

\VH\ 



< C. 



However, this easily follows from the observation that 



V ^ X 

\VH\ 



1 



\VH{x) 
(H3) n-l . 



< 



AH{x) 



\V'H(x) 



-^{x),V^H{x) 



\VH\ 



\VH\ 



and the fact that H(x) is by construction a smooth function on Bs{0)\Bs/2iO)- 
Finally, let us verify the property (H5). It is sufficient to show that 

Vt such that G Bs{0) : (vH{iP^{t)),VH{'^riit))^ > Cfj t |VF(?/;^(t))|, 

where the local coordinates 'ipn{t) are given by (4.14). Using (A. 3) we get 

Vi?(x),VF(x)\ - "^-^ /V7C7/™^ - \ , U^\IV7U/^M2 



= a{x) {VH{x),—J+b{x)\VH{x)(' 

> a{x) (vH{x),^\ +b{x) \VH{x)\ (vH{x),^, 

\ |x| / \ |x| 

^^>\aix) + bix) |Vi/(x)|) ^|x| 

(A.3) \ . 

> ^ |x| \VH{x)\. 
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A. Construction of the perturbed Hamiltonian H: proof of Proposition 4.1 

By the last inequality, it is only left to show that for some constant c > 

iV'rjWI >ct, for all iJriit) G ^5(0). (A.5) 

It follows from the definition (4.14) of ipn{t) that 

Jo Jo \\i^vi^)\ \VH{^l)^{s))\ I 

For Tprjis) £ -65(0) it follows from (A. 3) that | V/7(^.^(s))| < C. Therefore, we can 
estimate 



> > 0, 



where the last line is deduced with the same argument as used for (H3). A combination 
of (A.6) and (A. 7) yields the desired estimate (A.5) and therefore concludes the proof. □ 



A. 2. Correction at the saddles 

In this section, we give a construction of a perturbation H satisfying Proposition 4.1 
nearby saddle points. Every A:-saddle p lies on the boundary of il.. We will locally deform 
H in such a way that either trajectories will leave Q nearby the saddle or they will avoid 
coming close to the saddle (cf. Figure 4). We achive this behavior by introducing a kink 
on the stable manifold (cf. Figure 5). 

We assume w.l.o.g. that the saddle point p lies at the origin and let Q be an orthogonal 
matrix, such that 



V^HiO) = Qdiag(Ar, . . . , A^, A+, . . . , X^_,)Q^ , 

where A^, . . . , A^ < and A^*", . . . , A^_^ > 0. Therefore, the 
stable £^ and unstable £^ subspaces of V^//(0) are given 
by 



:={g(y,0)^GE" :yeE^} 
£^ := {Q(0,y)T G E" : y G E"-'^} . 











hsiXs) 

I 








, _0 
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A. Construction of the perturbed Hamiltonian H: proof of Proposition 4.1 




Figure 4: Flow lines derived from H (left) and H (right). 



The stable and unstable manifold of the saddle at are given 
by 

:= {yo e R" : = -VH{yt), yt ^ 0} 
:= {yo G : = VH{yt),yt ^ 0} . 

Theorem A. 2 (Stable manifold theorem [Tesl2, Theorem 9.4, p. 259]). For a small 

neighborhood U of the saddle the locally stable manifold Wi^^ := Pi U are smooth 

manifolds tangent to £^ at the origin. There exists neighborhoods ^ £^ , ^ £^ and 
a smooth map hs '-lA^ ^ satisfying 

h^{0) = and Dhs{0) = (A. 8) 

such that the local stable manifold allows the representation 

Wloc = {xs + /is(xs) : Xs e W} . 

Therewith, local coordinates on the saddle are given by the mapping 

C(Xs,Xu) -.W XW CK'' i{Xs,X^)^ Xs + hs{Xs)+X^. 

With the projections Hg and 

Ils-.U^W Hs = Q diag(0, . . . , 0, l...,l)Q^ 

fc-times (n— fc)-times 

n^-.U^W = Q diag( )Q^ 

fc-times (n— fc)-times 
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A. Construction of the perturbed Hamiltonian H: proof of Proposition 4.1 

we obtain the reverse coordinate transformation by 

U3x^ {UsX, U^x - hs{U^x)) £W X W. 

The construction of an additive perturbation is done with the help of two functions 
p, g G C^(]R+, [0, 1]) with support in [0, 1] such that 

p(0) = l, p'(0)=0, VrG(0,l):p'(r)<0 (Ap) 
\v'{r)\ < 8r for r G [0, 1] (Bp) 
p{r) > (1 - 2r)+ for r G [0, 1] (Cp) 

< r < . < 1 : < (Dp) 

q{0) = l, Vr G (0, 1) : (?'(r) < (Ag) 
(1 - 2r)+ < \q{r)\ < 2 for r G [0, 1] (Bg) 



Example A. 3. The following both functions satisfy (Ap)-(Dp) and (Ag)-(Bg) 

p{r) = {1 - r^)%o,i){r) and g(r) = (1 - r)(l - r^)^!;^ ,)(r). 

Lemma A. 4. For constants a,S > define the additive perturbation 

: ]R+ X ]R+ ^ R (s, n) = a5^p Q q (^) , (A.9) 

where p and q satisfy (Ap)-(Dp) and (Aq)-(Bq), respectively. Let us define the modified 
Hamiltonian H onU := {x G R" : \Ilsx\ < S, \IluX — hs{Ilsx)\ < 6} by 

H{x) := H{x) + H+^{\Ilsx\ , |n„a; - h,{Il,x)\). (A.IO) 

Then, there exists a constant a > and 5 small enough, such that H onU satisfies the 
properties (H1)-(H5) of Assumption 4-8 and Theorem 4-i- 

Proof. The properties (HI) and (H2) are fulfilled by the construction of in (A.9). 

Argument for (H3): Let x = Xs + x-^ £ U\Wf^^ with Xs = HsX and Xu = Hu, hence 
x-a 7^ hs{xs)- Further, set s = \xs\ and u = |xu — h{xs)\- Then, it follows from (A.IO) 

V^(:e) = VH{x)+dsH,l^{s,u)^ + duH+^{s,u){ld-DhJ{x,))^^^^^^^. 

s u 

Firstly, we expand VH{x) around the saddle point and find 

VH{x) = VH{xs + Xu) = V^H{0)xs + V^H{0){xu - hs{xs)) + R2{x), (A.ll) 
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A. Construction of the perturbed Hamiltonian H: proof of Proposition 4.1 




Figure 5: H (left) with a kink on the stable manifold and (right). 

where the remainder term R2{x) satisfies |i?2(a;)| < {Ch + C^) \x\^ < {Ch + ChjS"^ 
as |/is(a^s)| < Ch\xs\^ by (A. 8). We note that the partial derivatives of the additive 
perturbation are given by 

^s^itc(S'^) = O'^^^' (I) 9 (^) duH{s,u) = a5p{^ q (^^ 

Combining all expressions for the gradient of the perturbed Hamiltonian results in 



VH{x) = {V^H{0)S + a5p' (^) q Q Id) ^ + a6p q' Q DhJ {x^)^ 
=: Ns{x) + N^{x) + R2{x). 



u 



+ {v''H{0)u + a5p (0 (^) Id) ^^^''^^ + R2{x) 



(A.12) 



We want to stress out that {Ns{x), Nu{x)) = 0, because of \7^H{0) leaves and £^ 
invariant as well as DhJ {xs){x^ — hs{xs)) G and, of course, also {xs,x^ — hs{xs)) = 0. 
This leads us to the conclusion 

|VF(x)|2 = \Nsix)\^ + \N4x)\^ + 0{6^) > max{\Nsix)\\\N^{x)\^^ + 0(6'^). (A.13) 

We proceed with the calculation of lower bounds for |A'^s(x)| and |A'u(x)|. We start with 
|A'^s(x)| and denote with A^j^^ of the smallest positive eigenvalue of the Hessian \7^H{0) 

mx)\ > A+ - a6 \p' (^) \qQ- a6p q' Q |mj(xs)|. (A.14) 
We employ now the assumptions (Bp) and (Ag) to bound the second term in (A.14) by 

Likewise, we use the assumptions (Ap) and (Bg) and the additional observation that 
\DhJ{xs)\ < Ch \xs\ by (A. 8) to bound the third term in (A.14) by 

aSp (^) q Q) \DhJ{xs)\ < 2a6ChS < 2as, for 6 < C^^ . (A.16) 



< 8as. (A.15) 
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A. Construction of the perturbed Hamiltonian H: proof of Proposition 4.1 



Hence, we can deduce a lower bound for |A''s(a;)| by choosing a small enough. Indeed, we 
observe 

\Ns{x)\> {X+^^-8a-2a) s>6 ^ - for « < (^-1^) 

For the estimation of |A'u(a;)|, we use the fact that by the definition of the Hessian 
V^H{0) is strictly negative definite on U^. Therefore, let us denote by A"-^^ the smallest 
negative eigenvalue of V^-ff(O) in modulus. In addition, we use the assumptions (Cp) 
for p and (Bg) for q to deduce the lower bound for |A'u(x)| 



s 
6 



(Cp),(Bq) / S\ / U 

> \X-.Ju + aS (1 - 2-j (1-2- 



(A.18) 



where the last step follows from either setting u = or n = |. We combine the esti- 
mates (A. 17) and (A. 18) with (A. 13) and arrive at 



A™:ti s . f |A ■ I / 2s 



\VH{x)\ > 5max <j ^-,min I ( 1 - - ) } } + 0{6 



5min^max^^^,^|,max|^^,a('l-|) \\+0i5 



> o l^minl> ™ A+ a 1 - ^ }}+Oi5-2) 



2s 



,3 , 



5 

2 I Kin + 2a 



A+. 

It holds for a < -ffl^ that , ° > ° . Therewith, we can assume w.l.o.g. that 



a < min ^jA^j^jj^l A^j^^j and have the estimate 



min 

Hence, in the last step we can choose some 6 small enough such that the error term 
0(52) can be absorbed 



\VH{x)\>^-^ for a<min|^,2A-,„A+„ 



Argument for (H4): Let us recall of what condition (H4) consists of. We have to deduce 
a lower and upper bound for 

V ^ = I AH - ( :^,V^H- 



\VH\ \VH\ \ \ \VH\ \VH\ 
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A. Construction of the perturbed Hamiltonian H: proof of Proposition 4.1 

We observe that if Xi{x) < . . . A„(x) denote the ordered eigenvalues of V^i?(x), then the 
following bound holds 

{n - l)Ai(x) < Ux) <AH-i V^H \ < ^ A,(x) < (n - l)A„(x). 

1=1 \ I I I I / 1=2 

For the estimation of Ai(x) and An(x), we use the variational formulation of the eigen- 
values 



Ai(x)= inf {r],V^H{x)i]) and A„(x) = sup {r},V^H{x)r]) . (A.20) 

The first step is to extend H till second order. Therefore, we start with 'SJH and observe 
by comparing with (A. 12) 

V^(x) = VH{x)+dsH+^{s,u)'^ - duH+^{s,u)DhJix,)^^^^^^^ 

_\ ^ (A.21) 

+ duH+^{s,u) , 

The terms on the first line on the right hand in (A.21) are smooth C^-functions. There- 
fore, we can write 

VH{x) = ^^H^^^{s,u)^^^^^^^ + R^{x), (A.22) 

u 

where Ri{x) £ C^{W^) and ||-Ri ||ci(if{^n) < C\x\ for some constant C depending on H 
and hs with their first and second derivatives as well as on p and q. Hence, we note that 
\\7H{x)\ = 0(1). Now, we calculate the Hessian of H by only considering the leading 
term of (A.22) 

V'H{x) = DR,{x) + dlMs.nf-^^^^^ ® ^ 

+ dlK^u)^^^^^^ (n.-ml(x3))(..-/.s(x3)) ^^ 23) 

u u 

u \ ^ ^ u u 

The terms in the first two lines of (A. 23) are all at least 0(1) by the assumptions (Ap), 
(Bp), (Aq) and (Bg) as well as by the properties of hs given in its Definition (A. 8). The 

prefactor '^'^^ioc(^'^^ Qf f^j^g i^st term in (A. 23) blows up like since duH^^{s,u) < 
by assumption (Ag). The matrix in the last line is of the form A — v Av , where 
A = ld-DhJ{xs) and v = G S*""^. We can assume that A > 0, because of 

\DhJ{xs)\ = 0{s) by (A. 8). For a matrix A > holds A - v Av > with equality 
only if V is an eigenvector of A. If we use these observations together with (A.20), we 
arrive at 



-C 



u 



< Ai(x) < A„(x) < C, 
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A. Construction of the perturbed Hamiltonian H: proof of Proposition 4.1 



where C is constant depending on \DRi{x)\, H^^{s,u) with its first and second deriva- 
tives and hg with its first derivative. Now, we estimate with the definition of and 
with (Bq) 



-a6 p 



u 



> -2a6p 



In combination with (A. 19), we arrive at a pointwise version of (H4 



-{n-l)\C + 



In the last step of the proof of (H4), we have to combine the estimate A. 2 with the 
trajectories iprj{t) in U. Therefore, we introduce the functions s{t) and u{t) given by 

s{t) := \Us^prj{t)\ and u{t) := |n„V»?W - hs (HsV'ryW)! ■ 

Further we denote with Psi') = pi-g)- To finish the proof of (H4), we have to show that 
for a trajectory ipriit) in U holds 

P5{s{t)) „ / 1 



Now, to show (A. 24), we deduce first two differential inequalities for u{t) and s{t) which 
holds for 5 sufficiently small and a satisfying (A. 17) 

l>m> ^8 + 0(5^), (A.25) 

1 > -u{t) > a{S - 2u{t))+ps{s{t)) + 0{5). (A.26) 

The deduction of (A.26) and (A.25) is similar to the estimate of A'^s and N^. By direct 
calculation and term-wise estimation of |V//| in (A. 21), we find that \VH\ = 0{5) in U. 
Hence we can assume from now that \VH\ < 1. The starting point for all estimates is 
the representation of 'VH given in (A. 11) 



1 



We follow along the lines of (A. 14) and (A. 17) to estimate (■0^(t), A'^g(^/>^(t))) and find 



sit) 



"^^^(^V^i.(0)^,(t)\+c^.i.,tc(Kt),n(t))|n^^^(^)l' 



s{t) 
s{t) ■ 



(A.14) 

> A+ ,s(i) - a6 



p 



/ fsit) 



u{t) 



a5p 



u{t) 
s{t)\ ,[u{t) 



Dhl{Ii,i;r,{t)) 



(A.15),(A.16) 

> ^iinKO - 8as(t) - 2aChs{t). 
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A. Construction of the perturbed Hamiltonian H: proof of Proposition 4.1 



Using the same choice of a as in (A. 17) shows (A. 25). For the differential inequality (A. 26) 
we have to take care of the sign of u{t) 



-n(t) 



> 



u{t) 
uit) 



|V#(V,(t))| 



+ \Dh,(U,i;^m+0{5'). 



Since \Dhs(iisi^n{t))\ = 0{\llsipri{t)\) = 0{5) by (A. 8), we only have to consider the first 
term. We can use a similar estimate to (A. 18) and find 



u{t) 



\VHi^p,m 



6 



(B,) 



> |A-ij7x(t) + a5 1-2 



At) 



6 
P 



sit) 



which concludes the proof of (A. 26). 



Let us introduce the entrance time E^j of '0r?(i) hito suppi/j^^ where we can assume 
that supp-ffjQ^ C B2S- From (A. 25) follows that the function s{t) is strictly increasing, 
whenever s{t) is large enough and 5 small enough. Let us assume that 5 is chosen small 
enough such that s{t) > Cs > whenever s{t) > |. Then, we can consider two cases: 
First, there is to G i^ri, T^) with s(to) ^ | and hence s{t) is increasing for t G (tQ,Tfj) and 
second there is no such to and s{t) G (0, |) for all t £ {E>ri,T^). Let us first consider the 
latter case. Further, as we are only interested in the blow-up behavior, we can assume 
w.l.o.g. u{t) < I and find 



a5 

-u(t) > —p 



+ 0(6^) >cu>0. 



(A.27) 



Hence, especially u{t) > Cu{T^ — t) from which we deduce with (Aj 



< 



1 



Ps{s{t)) 

U{t) ~ Cui^ - t) 



and therewith also (A. 24). Now, we assume that there exists to £ (-^rj) ^r?) with s(to) > 
We use u{t) > f /fX^)dt' for t G (to,T^) by (A.26). Then it holds 



Ps{s{t)) ^ 2 



Ps{s{t)) 



(A.25) 2 
< 



psisit)) (Dp) 



u{t) ~ a<^/^^>5(s(t'))dt' ~ ''^ 1%'^ Psis)ds ~ a6 {sm)-s{t))^' 
By using the lower bound s(t) > > 0, we can infer 



siTr,) - S{t) 



s(T)dT > CsiTr, - t), 
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A. Construction of the perturbed Hamiltonian H: proof of Proposition 4.1 
from which the estimate (A. 24) follows. 

Argument for (H5): We conclude in several steps. Firstly, we deduce a pointwise estimate 

(A.28) 

Then, in the second step, we get rid of the error terms, as they can be compensated by the 
first and second term of (A.28), respectively. If we for the moment assume that (A.28) 
holds, we still need to recombine this estimate (A.28) with the trajectories. Hereby, we 
use again (A. 25) and (A. 26). Like in deducing (H4), we have to consider two cases: 
First, there exists to G {En^T^^) with s(to) > \5 and secondly, there exists no such to- In 
the first case, there is nothing to show, since we can assume that s{t) is monotone and 
therefore with (A.28) holds 

yt e {to,Tr,) : {VH{ij,{t)),VH{^,m > i^llll!,(to)2 > i^^S^. 

Now, we can assume, that s{t) < for all t G {Ejj,T^) or for all t G {Efj,to). Therefore 
it holds (A. 27) and we can deduce the estimate for u < | 

{VH{i;^{t)),VH{i;^m > \X-,Ja^-p u{t) > \X-,jJ-p c„(r^ -t). 

The conclusion (H5) follows from the observation that |V/f| = 0{5) in U and therefore 
especially |V//| < 1 for 6 chosen appropriate small enough. 

To finish the proof, we still have to show (A.28). We start with (A. 11) and (A. 12) to 
deduce 

(vH{x),VH{x)^ = {V^H{0){x^ + X, - hsix^)) + R2ix),Nsix) + iV,(x) + R2ix)) 

= {V^H{0){x^ + x^- /is(xs)), N^{x) + N^{x)) 
+ {R2{x),Ns{x) + Nu{x) + V'^H{0){xs + x^ - hs{xs)) + i?2(x)> . 

Let us first estimate the order of the error terms containing R2{x). With (A. 11) we find 
that \R2{x)\ = 0(|x|^) = 0(s2) + ©(u^). Further, we have \Nsix)\ = 0{6) as well as 
|A'tt(x)| = 0{6) and of course |rE| = 0{6). Therefore, the total approximation error can 
be estimated by 

{R2ix),Nsix) + Nu{x) + V'^H{0){xs + - Kixs)) = OiSs^) + 0{u'^5). 
We find an estimate for (V^i?(0)(xs + x^ — hs{xs)), Ns{x) + A''u(x))by using the repre- 
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B. Gaussian integrals and linear algebra 



sentation of VH in (A. 12) and going along the lines of (A. 14) and (A. 18) 

= {V^H{0)x„Ns) + {V^H{0) (x, - hsix,)) , iVu(x)) 
> (V^H{0)x„V^H{0)xs) 



a5 



a6p 



u 



V^H{0)xs 
\V'^H{0)xs\ 



+ aSp Q \q' (^) I /-V'H{0) (x, - hs{x,)) , 



u 



2„,2 



2u 



\X-„ u. 



We finish by estimating the first three terms. Therefore, we use, like in (A. 17), the 
assumption that 5 < and obtain 

{X^^fs" - SaX-^^y - 2aXl,JCHs' > ((A+ _ lOaA^,,) > i^l!,^ 



where we set a < J?+ in the last estimate, which leads to (A. 28) and finishes the 



proof. 



□ 



B. Gaussian integrals and linear algebra 
B.l. Partial Gaussian integrals 

This section is devoted to proof the representation for partial or incomplete Gaussian 
integrals. The formula (B.l) is an ingredient to evaluate the weighted transport cost in 
Section 3.3. 

Lemma B.l (Partial Gaussian integral). Let G S~^{n) be a symmetric positive 

definite matrix and let rj G S^~^ be a unit vector. Therewith, {'''V ~^ ^.^^ for 
Z"*" G spanjr;}"'" an affine subspace o/R". The integral of a centered Gaussian w.r.t. to 
this subspace is given by 

,(-lE-N + .-l)d. = -^exp(-E-M---l), 



In 
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Proof. To evaluate this integral on an one-dimensional subspace of R", we have to expand 
the quadratic form S~^[rr/ + Z"*-] and arrive at the relation 

J exp ^-^S"^[rr7 + z-^]^ dr 

= exp (^-Ie-^z^]) ^exp (^-^S-i[r?] +r (r?,S-iz^)^ dr 



2 ' 7v/E^ 'V ^^''t"'! 

5^ ^J_i(^-^_?:!i^],,^ 



which concludes the hypothesis. □ 



B.2. Subdeterminants, adjugates and inverses 

Let A e S'^{n), then define for 77 G S''*"^ the matrix 

A:=A-^^l^. (B.l) 

The matrix A has at least rank n — 1, since we subtracted from the positive definite 
matrix A a rank-1 matrix. Further, from the representation it is immediate, that A has 
rank n — 1 if and only if 77 is an eigenvector of A. In this case ker^l = spanry. It is easy 
to show that 

A> on span {r/}"*" . 

Let V = spanlry}"*" be the (n — l)-dimensional subspace perpendicular to 7]. Then for a 
matrix A £ S'^{n) we want to calculate the determinant of A restricted to this subspace 
V. This determinant is obtained by first choosing Q G 50" such that Q{{0} xR"^^) = V 
and then evaluating the determinant of the minor consisting of the (n — 1) x (n— 1) lower 
right submatrix of AQ denoted by deii^i{Q^ AQ) . Hence, we have 

deti,i(Q^^(5), with Q e SO{n) : Q^r/ = = (1, 0, . . . , 0)^. 

Since V = spanjry}^, it follows that the first column of Q is given by 77 and we can 
decompose AQ into 

\Q'^Ai] Q^AQ) 

where for a matrix M, M is the lower right (n — 1) x (n — 1) submatrix of M and for a 
vector the {n—X) lower subvector oiv. Therewith, we find a similarity transformation 
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which apphed to AQ results in 

det^ = detQT^Q = det (_^T ^ ']{ Mv] 

\\Q^Ar] Q^AQJ \0 Id„_i ^ 



det 



The determinant of the minor is given by 

deti,, AQ — J = deti, [A - J Q 

Hence, by the definition (B.l) of A and the subdeterminant we found the identity 

det^ = A[r/]deti,i(Q^^Q). (B.2) 

B.3. A matrix optimization 

Lemma B.2. Let B € S^{n — 1), then it holds 

inf \—^^=:2A>B\=Vd^ 
A(^s+{n-i) y Y^det {2A - B) J 

and for the optimal A holds A = B. 
Proof. We note that 

det ^ _ 1 
Vdet i2A - B) ^det(^-i) det (2 Id -^"s^A-i) 

Therewith, it is enough to maximize the radical of the root. Therefore, we substitute 
A~2 = CB~2 with C > not necessarily symmetric and observe that A~^ = B~2C^. 
We obtain 

det(A-i)det(2Id-A-is^-5) = det(B-i) det(C7C7^) det(2 Id -C7C^). 
Note, that CC^ G S^{n) and it is enough to calculate 

sup |det(C') det(2 Id -C) : C < 2 Id) . 

From the constraint < C < 2Id we can write C = Id+D, where D is symmetric and 
satisfies — Id < < Id in the sense of quadratic forms. From here, we finally observe 

det(C) det(2 Id -C) = det(Id +D) det(Id -D) = det(Id -D^). 

Since > 0, we find the optimal C given by Id, which yields that A = B. □ 
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B. 4. Jacobi's formula 

Lemma B.3 (Jacobi's formula). Let R B t ^ <Pt ^ {A e E"""" : det A / 0} be a differ- 
entiable function, then 

^Indet^t = tr($,-^6t) . (B.3) 

Proof. We first note that the determinant of <^(t) is a multilinear function d of the 
columns cj)], . . . , 0", i.e. det $t = d{(j)j, Then, it follows 

^ det = di<pj,cPl . . . , <^«) + . . . + d(<^i, . . . , <Ar\ cAD- 

Now, the proof consists of two steps. We first proof the identity (B.3) for $t = Id and 
then generalize this result. If we assume w.l.o.g. that = Id. By expanding the 
determinant d{<pl, (/>|, . . . , (/>") along its first column it immediately follows that 

a!(<A^<A?,...,</>r) = 4'''- 

From here we conclude that 

— det = tr$t. 
at 

Now, let $j = A be a general invertible matrix. Hence, we can apply the result from the 
first step to A~^^t and arrive at 

^det (A-i$t) = tr (^-1$^) . 
The results follows by substituting A back. □ 

C. Existence of a spectral gap for L 

In this short part of the appendix, we state a standard argument that ensures that the 
conditions (Al) and (A2) imply a spectral gap. More precisely, we make the following 
definition of the spectral gap 

Definition C.l. We say that the operator L = eA — VH ■ V given by (1.1) satisfies a 
spectral gap of order g > i.e. SG(^>), if L has discrete non-negative spectrum and the 
order eigenvalues Aj satisfy 

Ao = < Al < A2 < . . . and Ai > g. 

We take over the argument of [Kun02, Proposition 3.7] and adapt it to the case of small 
noise e. 
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Proposition C.2. Assume that H satisfies (Al) and (A2) with constants Ch,Kh > 0. 
Then for all e < wi 
satisfies SG{g) for some g > . 



1 

Then for all e < 2 2c+Kh+C^ with c > the operator L = eA — V// • V given by (1.1) 



Proof. The operator L = eA — VH ■ V on -L^(^) can be transformed into a Schrodinger 
operator with potential 

Ls :=-eA + ^\VHf -^AH 

on L'^{dx). This can be seen by using the unitary transformation U : L^(da;) — )■ 
given by / I— 7- exp(^)/. So, by partial integration we get 

£{Uf, Ug)=e jvf-Vg + ^\VHffg + ^Vi/ • {fVg + gVf) dx 

= JeVf-Vg+ilJVHl^-^AH) fg dx = j {Lsf)g dx. 

For the Schrodinger operator Ls it is known (see e.g. [BS91, Theorem 3.1]) that 

liminf (^|Vi7p - |Ai7) > (C.l) 

is a sufficient condition to have a discrete spectrum on (— 00, c) and in addition for every 
c' < c and C' < c« to have a finite spectrum on (— C", c'). The condition (C.l) is implied 

by the assumption (A2) for e < 2 2c+Kh+C'^ ^^^^ proof of Lemma 4.6 for a similar 
argument). Since the transformation U was unitary Ls and —L have the same spectrum. 
Hence, L has also a discrete spectrum on (— c, 00) and as it is a non-negative operator 
it has a discrete spectrum on [0, 00) which is finite on [0, C) for every C < 00, which 
implies a spectral gap. □ 
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D. Conventions and notation 



\JLtJU 


n.-dinicnsional Lebesgue measure 


H''{dx) 


A;-dimensional Hausdorff measure 




density of of the probability measure fi 


III fix) 


short form for /x(*)d.x 




conditional measure obtained from /i by conditioning on y 


z 


a generic partition sum normalizing a probability measure 


SUpp jJL 


the support of supp ji = {//(x) > 0} 






/XLS 4 


restriction of /i onto C supp/i, (/iLr2)(dx) := "^^^"^ /i(a;)dx 


%M 


push-forward measure under 4> : R" — > W D U, (a>{j//)((7) = /i(4> [U)) 




matrix transpose 


tr A 


trace of A 


{x,y} =x-y 


buclidean scalar product between x, y € K 




Euclidean norm 




testmg a quadratic torm A[x\ = {x, Ax) 




QcLClllililcxilL Oi Lllc llidLllX /I Willi fc-lll lUW cLllCl j-lll CUlUlllll IclllUVcCl 




/l[rl < for all T ^ IR^ 




spf, of svTTiTTiptvir Dositivp rlpfiTiitp matTirps i p A = aTirl /4 ^ 


Qn-l 


Lllllt o^lici C 111 JLV J l.C 1// ^ ILX. • O/ — ±r 


<(n,9) 


geodesic on S^~^ between r],6 G S^~^ 


SO{n) 


rotational matrices in R", Q E R'''''': Q'^ = , detQ = 1. 


V^H 


Hessian of : R" ^ E 


V F 


the divergence of F : R" ->■ R" 


ipt 


derivative in parameter t G R 


u{e) 


1 

a smooth monotone decreasing function satisfying > Ine 2 for £ < 1 


< > 


=) <5 ^ up to multiplicative error of the form 1 + 0{y/e u^(e)). 


C,c 


constants only depending on the dimension n and H 




constants additionally depending on the function or variable / 
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